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The present invention provides improved adenovirus vectors and packaging cell lines. One type of improved adenoviral vector 
comprises deletions within the E2b region of the adenoviral genome. These E2b-deleted virus are used in conjunction with novel cell 
lines that constitutively express E2b gene products. The present invention further provides adenoviral vectors deleted for all viral coding 
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IMPROVED ADENOVIRUS VECTORS 

■ 

FIELD OF THE INVENTION 

The invention relates to improved adenovirus vectors, and more specifically, 
5 adenovirus vectors useful for gene therapy. 

BACKGROUND 

Adenoviruses (Ad) are double-stranded DNA viruses. The genome of adenoviruses 
(-36 kb) is complex and contains over 50 open reading frames (ORFs). These ORFs are 

1 0 overlapping and genes encoding one protein are often embedded within genes coding for other 
Ad proteins. Expression of Ad genes is divided into an early and a late phase. Early genes 
are those transcribed prior to replication of the genome while late genes are transcribed after 
replication. The early genes comprise Ela, Elb. E2a, E2b. E3 and E4. The Ela gene 
products are involved in transcriptional regulation; the Elb gene products are involved in the 

15 shut-off of host cell functions and mRNA transport. E2a encodes the a DNA-binding protein 
(DBP); E2b encodes the viral DNA polymerase and preterminal protein (pTP). The E3 gene 
products are not essential for viral growth in cell culture. The E4 region encodes regulatory 
protein involved in transcriptional and post-transcriptional regulation of viral gene expression: 
a subset of the E4 proteins are essential for viral growth. The products of the late genes (e.g., 

20 LI -5) are predominantly components of the virion as well as proteins involved in the 
assembly of virions. The VA genes produce VA RNAs which block the host cell from 
shutting down viral protein synthesis. 

Adenoviruses or Ad vectors have been exploited for the delivery of foreign genes to 
cells for a number of reasons including the fact that Ad vectors have been shown to be highly 

25 effective for the transfer of genes into a wide variety of tissues in vivo and the fact that Ad 
infects both dividing and non-dividing cells; a number of tissues which are targets for gene 
therapy comprise largely non-dividing cells. 

The current generation of Ad vectors suffer from a number of limitations which 
preclude their widespread clinical use including: 1) immune detection and elimination of cells 

30 infected with Ad vectors. 2) a limited carrying capacity (about 8.5 kb) for the insertion of 
foreign genes and regulatory elements, and 3) low-level expression of Ad genes in cells 
infected with recombinant Ad vectors (generally, the expression of Ad proteins is toxic to 
cells). 
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The latter problem was thought to be solved by using vectors containing deletions in 
the El region of the Ad genome (El gene products are required for viral gene expression and 
replication). However, even with such vectors, low-level expression of Ad genes is observed. 
It is now thought that most mammalian cells contain El-like factors which can substitute for 
the missing Ad El proteins and permit expression of Ad genes remaining on the El deleted 
vectors. 

What is needed is an approach that overcomes the problem of low level expression of 
Ad genes. Such an approach needs to ensure that adenovirus vectors are safe and non- 
immunoaenic. 

t 

SUMMARY OF THE INVENTION 

The present invention contemplates two approaches to improving adenovirus vectors. 
The first approach generally contemplates a recombinant plasmid. together with a helper 
adenovirus, in a packaging cell line. The helper adenovirus is rendered safe by utilization of 
loxP sequences. In the second approach, "damaged" adenoviruses are employed. While the 
"damaged" adenovirus is capable of self-propagation in a packaging cell line, it is not capable 
of expressing certain genes (e.g., the DNA polymerase gene and/or the adenovirus preterminal 
protein gene). 

In one embodiment of the first approach, the present invention contemplates a 
recombinant plasmid. comprising in operable combination: a) a plasmid backbone, 
comprising an origin of replication, an antibiotic resistance gene and a eukaryotic promoter 
element: b) the left and right inverted terminal repeats (ITRs) of adenovirus, said ITRs each 
having a 5* and a 3* end and arranged in a tail to tail orientation on said plasmid backbone: 
c) the adenovirus packaging sequence, said packaging sequence having a 5" and a 3' end and 
linked to one of said ITRs: and d) a first gene of interest operably linked to said promoter 
element. 

While it is not intended that the present invention be limited by the precise size of the 
plasmid. it is generally desirable that the recombinant plasmid have a total size of between 27 
and 40 kilobase pairs. It is preferred that the total size of the DNA packaged into an EAM 
derived from these recombinant plasmids is about the length of the wild-type adenovirus 
genome (-36 kb). It is well known in the art that DNA representing about 105% of the wild- 
type length may be packaged into a viral particle: thus the EAM derived from recombinant 
plasmid may contain DNA whose length exceeds by -105% the size of the wild-type genome. 
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The size of the recombinant plasmid may be adjusted using reporter genes and genes of 
interest having various sizes (including the use of different sizes of introns within these genes) 
as well as through the use of irrelevant or non-coding DNA fragment which act as "stuffer" 
fragments (e.g.. portions of bacteriophage genomes). 

In one embodiment of the recombinant plasmid, said 5" end of said packaging 
sequence is linked to said 3 * end of said left ITR. In this embodiment, said first gene of 
interest is linked to said 3' end of said packaging sequence. It is not intended that the present 
invention be limited by the nature of the gene of interest; a variety of genes (including both 
cDNA and genomic forms) are contemplated; any gene having therapeutic value may be 
inserted into the recombinant plasmids of the present invention. For example, the transfer of 
the adenosine deaminase (ADA) gene is useful for the treatment of ADA- patients: the 
transfer of the CFTR gene is useful for the treatment of cystitic fibrosis. A wide variety of 
diseases are known to be due to a defect in a single gene. The plasmids. vectors and EAMs 
of the present invention are useful for the transfer of a non-mutated form of a gene which is 
mutated in a patient thereby resulting in disease. The present invention is illustrated using 
recombinant plasmids capable of generating encapsidated adenovirus minichromosomes 
(EAMs) containing the dystrophin cDNA gene (the cDNA form of this gene is preferred due 
to the large size of this gene): the dystrophin gene is non-functional in muscular dystrophy 
(MD) patients. However, the present invention is not limited toward the use of the dystrophin 
gene for treatment of MD: the use of the utrophin (also called the dystrophin related protein) 
gene is also contemplated for gene therapy for the treatment of MD [Tinsley et al. (1993) 
Curr. Opin. Genet. Dev. 3:484 and (1992) Nature 360:591]; the utrophin gene protein has 
been reported to be capable of functionally substituting for the dystrophin gene [Tinsley and 
Davies (1993) Neuromusc. Disord. 3:539]. As the utrophin gene product is expressed in the 
muscle of muscular dystrophy patients, no immune response would be directed against the 
utrophin gene product expressed in cells of a host (including a human) containing the 
recombinant plasmids. Ad vectors or EAMs of the present invention. While the present 
invention is illustrated using plasmids containing the dystrophin gene, the plasmids. Ad 
vectors and EAMs of the present invention have broad application for the transfer of any gene 
whose gene product is missing or altered in activity in cells. 

Embodiments are contemplated wherein the recombinant plasmid further comprises a 
second gene of interest. In one embodiment, said second gene of interest is linked to said 3* 
end of said right ITR. In one embodiment, said second gene of interest is a reporter gene. A 
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variety of reporter genes are contemplated, including but not limited to E. coli p-galactosidase 
gene, the human placental alkaline phosphatase gene, the green fluorescent protein gene and 
the chloramphenicol acetyltransferase gene. 

As mentioned above, the first approach also involves the use of a helper adenovirus in 
combination with the recombinant plasmid. In one embodiment, the present invention 
contemplates a helper adenovirus comprising i) first and a second loxP sequences, and ii) the 
adenovirus packaging sequence, said packaging sequence having a 5" and a 3' end. It is 
preferred that said first loxP sequence is linked to the 5" end of said packaging sequence and 
said second loxP sequence is linked to said 3' end of said packaging sequence. In one 
embodiment, the helper virus comprises at least one adenovirus gene coding region. 

The present invention contemplates a mammalian cell line containing the above- 
described recombinant plasmid and the above-described helper virus. It is preferred that said 
cell line is a 293-derived cell line. Specifically, in one embodiment, the present invention 
contemplates a mammalian cell line, comprising: a) a recombinant plasmid. comprising, in 
operable combination: i) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element, ii) the left and right inverted 
terminal repeats (ITRs) of adenovirus, said ITRs each having a 5' and a 3' end and arranged 
in a tail to tail orientation on said plasmid backbone, iii) the adenovirus packaging sequence, 
said packaging sequence having a 5' and a 3' end and linked to one of said ITRs. and iv) a 
first gene of interest operably linked to said promoter element: and b) a helper adenovirus 
comprising i) first and a second loxP sequences, and ii) the adenovirus packaging sequence, 
said packaging sequence having a 5" and a 3' end. As noted previously, said helper can 
further comprise at least one adenovirus gene coding region. 

Overall, the first approach allows for a method of producing an adenovirus 
minichromosome. In one embodiment, this method comprises: A) providing a mammalian 
cell line containing: a) a recombinant plasmid. comprising, in operable combination, i) a 
plasmid backbone, comprising an origin of replication, an antibiotic resistance gene and a 
eukaryotic promoter element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail to tail orientation 
on said plasmid backbone, iii) the adenovirus packaging sequence, said packaging sequence 
having a 5" and a 3" end and linked to one of the ITRs. and iv) a first gene of interest 
operably linked to said promoter element; and b) a helper adenovirus comprising i) first and a 
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second loxP sequences, ii) at least one adenovirus gene coding region, and iii) the adenovirus 
packaging sequence, said packaging sequence having a 5" and a 3' end; and 
B) growing said cell line under conditions such that said adenovirus gene codine region is 
expressed and said recombinant plasmid directs the production of at least one adenoviral 
minichromosome. It is desired that said adenovirus minichromosome is encapsidated. 

In one embodiment, the present invention contemplates recovering said encapsidated 
adenovirus minichromosome and. in turn, purifying said recovered encapsidated adenovirus 
minichromosome. Thereafter, said purified encapsidated adenovirus minichromosome can be 
administered to a host (e.g., a mammal). Human therapy is thereby contemplated. 

It is not intended that the present invention be limited by the nature of the 
administration of said minichromosomes. All types of administration are contemplated, 
including direct injection (intra-muscular. intravenous, subcutaneous, etc.). inhalation, etc. 

As noted above, the present invention contemplates a second approaches to improving 
adenovirus vectors. In the second approach, "damaged" adenoviruses are employed. In one 
embodiment, the present invention contemplates a recombinant adenovirus comprising the 
adenovirus E2b region having a deletion, said adenovirus capable of self-propagation in a 
packaging cell line and said E2b region comprising the DNA polymerase gene and the 
adenovirus preterminal protein gene. In this embodiment, said deletion can be within the 
adenovirus DNA polymerase gene. Alternatively, said deletion is within the adenovirus 
preterminal protein gene. Finally, the present invention also contemplates embodiments 
wherein said deletion is within the adenovirus DNA polymerase and preterminal protein 
uenes. 

The present invention further provides cell lines capable of supporting the propagation 
of Ad virus containing deletions within the E2b region. In one embodiment the invention 
provides a mammalian cell line stably and constitutively expressing the adenovirus El gene 
products and the adenovirus DNA polymerase. In one embodiment, these cell lines comprise 
a recombinant adenovirus comprising a deletion within the E2b region, this E2b-deleted 
recombinant adenovirus being capable of self-propagation in the cell line. The present 
invention is not limited by the nature of the deletion within the E2b region. In one 
embodiment, the deletion is within the adenoviral DNA polymerase gene. 

The present invention provides cells lines stably expressing E 1 proteins and the 
adenoviral DNA polymerase, wherein the genome of the cell line contains a nucleotide 
sequence encoding adenovirus DNA polymerase operably linked to a heterologous promoter. 
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In a particularly preferred embodiment, the cell line is selected from the group consisting of 
the B-6. B-9. C-l. C-4. C-7. C-13. and C-14 cell lines. 

The present invention further provides cell lines which further constitutively express 
the adenovirus preterminal protein (pTP) gene product (in addition to El proteins and DNA 
polymerase). In one embodiment, these pTP-expressing cell lines comprise a recombinant 
adenovirus comprising a deletion within the E2b region, the recombinant adenovirus being 
capable of self-propagation in the pTP-expressing cell line. In a preferred embodiment, the 
deletion within the E2b region comprises a deletion within the adenoviral preterminal protein 
gene. In another preferred embodiment, the deletion within the E2b region comprises a 
deletion within the adenoviral (Ad) DNA polymerase and preterminal protein genes. 

In a preferred embodiment, the cell lines coexpressing pTP and Ad DNA polymerase, 
contain within their genome, a nucleotide sequence encoding adenovirus preterminal protein 
operably linked to a heterologous promoter. In the invention is not limited by the nature of 
the heterologous promoter chosen. The art knows well how to select a suitable heterologous 
promoter to achieve expression in the desired host cell {e.g.. 293 cells or derivative thereof). 
In a particularly preferred embodiment, the pTP- and Ad polymerase-expressing cell line is 
selected from the group consisting of the C-l, C-4. C-7, C-13, and C-14 cell lines. 

The present invention provides a method of producing infectious recombinant 
adenovirus particles containing an adenoviral genome containing a deletion within the E2b 
region, comprising: a) providing: i) a mammalian cell line stably and constitutively 
expressing the adenovirus El gene products and the adenovirus DNA polymerase: ii) a 
recombinant adenovirus comprising a deletion within the E2b region, the recombinant 
adenovirus being capable of self-propagation in said cell line: b) introducing the recombinant 
adenovirus into the cell line under conditions such that the recombinant adenovirus is 
propagated to form infectious recombinant adenovirus particles: and c) recovering the 
infectious recombinant adenovirus particles. In a preferred embodiment, the method further 
comprises d) purifying the recovered infectious recombinant adenovirus particles. In yet 
another preferred embodiment, the method further comprises e) administering the purified 
recombinant adenovirus particles to a host which is preferably a mammal and most preferably 
a human. 

In another preferred embodiment the mammalian cell line employed in the above 
method further constitutively expresses the adenovirus preterminal protein. 
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The present invention further provides a recombinant plasmid capable of replicating in 
a bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the polymerase gene, the deletion resulting in reduced polymerase activity. The 
present invention is not limited by the specific deletion employed to reduce polymerase 
activity. In a preferred embodiment, the deletion comprises a deletion of nucleotides 8772 to 
9385 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
designation pApol. In another preferred embodiment, the recombinant plasmid has the 
designation pBHGHApol. 

The present invention also provides a recombinant plasmid capable of replicating in a 
bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the preterminal protein gene, the deletion resulting in the inability to express functional 
preterminal protein without disruption of the VA RNA genes. The present invention is not 
limited by the specific deletion employed to render the pTP inactive: any deletion within the 
pTP coding region which does not disrupt the ability to express the Ad VA RNA genes may 
be employed. In a preferred embodiment, the deletion comprises a deletion of nucleotides 
10.705 to 1 1,134 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid 
has the designation pApTP. In another preferred embodiment, the recombinant plasmid has 
the designation pBHGHApTP. 

In a preferred embodiment, the recombinant plasmid containing a deletion with the 
pTP region further comprises a deletion within the polymerase gene, this deletion resulting in 
reduced (preferably absent) polymerase activity. The present invention is not limited by the 
specific deletion employed to inactivate the polymerase and pTP genes. In a preferred 
embodiment, the deletion comprises a deletion of nucleotides 8.773 to 9586 and 11,067 to 
12.513 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
designation pAXBApolApTPVARNA+tl3. In another preferred embodiment, the recombinant 
plasmid has the designation pBHGl lApolApTPVARNA+tl3. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 A is a schematic representation of the Ad polymerase expression plasmid 
pRSV-pol indicating that the Ad2 DNA polymerase sequences are under the transcriptional 
control of the RSV-LTR/promoter element and are flanked on the 3* end by the SV-40 small 
t intron and SV-40 polyadenylation addition site. 
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Figure IB is a schematic representation of the expression plasmid pRSV-pTP 
indicating that the Ad2 preterminal protein sequences are under the transcriptional control of 
the RSV-LTR/promoter element and are flanked on the 3" end by the SV-40 small-t intron 
and SV-40 polyadenylation signals. 

Figure 2 is an ethidium bromide-stained gel depicting the presence of Ad pol DNA 
sequences in genomic DNA from LP-293 cells and several hygromycin-resistant cell lines. 
The -750 bp PCR products are indicated by the arrow. 

Figure 3 is an autoradiograph depicting the results of a viral replication- 
complementation assay analyzing the functional activity of the Ad polymerase protein 
expressed by LP-293 cells and several hygromycin resistant cell lines. The 8.010 bp Hindlll 
fragments analyzed by densitometry are indicated by an arrow. 

Figure 4 is an autoradiograph indicating that cell lines B-6 and C-7 contained a 
smaller and a larger species of Ad polymerase mRNA while LP-293 derived RNA had no 
detectable hybridization signal. The location of the two species of Ad polymerase mRNA are 
indicated relative to the 28S and 18S ribosomal RNAs. and the aberrant transcript expressed 
by the B-9 cell line is indicated by an arrow. 

Figure 5 is an autoradiograph showing which of the cell lines that received the 
preterminal protein expression plasmid indicated the presence of pRSV-pTP sequences (arrow 
labelled "pTP") and El sequences (arrow labelled "El"). 

Figure 6 is an autoradiograph indicating in which of the cell lines transcription of 
preterminal protein is occurring (arrow labelled "~ 3kb"). 

Figure 7 is an autoradiograph indicating that the expression of the Ad-polymerase 
could overcome the replication defect of H5swM00 at non-permissive temperatures. 

Figure 8 graphically depicts plaque titre for LP-293, B-6 and C-7 cell lines infected 
with wtAd5. H5ts36. or H5sub]QQ, and the results demonstrate that the C-7 cell line can be 
used as a packaging cell line to allow the high level growth of El. preterminal, and 
polymerase deleted Ad vectors. 

Figure 9 is an autoradiograph showing that the recombinant pol" virus is viable on 
pol-expressing 293 cells but not on 293 cells which demonstrates that recombinant Ad viruses 
containing the 612 bp deletion found within pApol lack the ability to express Ad polymerase. 

Figure 10 is a schematic representation of the structure of pAd5pdys wherein the two 
inverted adenovirus origins of replication are represented by a left and right inverted terminal 
repeat (LITR/RITR). PI and P2 represent location of probes used for Southern blot analysis. 
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Figure 1 1 graphically illustrates the total number of transducing adenovirus particles 
produced (output) per serial passage on 293 cells, total input virus of either the helper (hpAP) 
or Ad5pdys. and the total number of cells used in each infection. 

Figures I2A-B show Southern blot analyses of viral DNA from lysates 3. 6. 9 and 12, 
5 digested with the restriction enzymes BssHll. Nru\ and EcoKV. For the analyses, fragments 
from the C terminus of mus musculus dystrophin cDNA (A) or the N terminus of E coli p- 
galactosidase (B) were labeled with dCTP 32 and used as probes. 

Figures 13A-B show the physical separation of Ad5pdys from hpAP virions at the 
third (A) and final (B) stages of CsCl purification. 
10 Figure 14 graphically depicts the level of contamination of AdSpdys EAMs by hpAP 

virions obtained from the final stage of CsCl purification as measured by p-galactosidase and 
alkaline phosphatase expression. The ratio of the two types of virions - Ad5pdys EAMs 
(LacZ) or hpAP (AP) in each fraction is indicated in the lower graph. 

Figures 15A-B are western blot (immunoblot) analyses of protein extracts from mdx 
15 myoblasts and myotubes demonstrating the expression of P-galactosidase (A) and dystrophin 
(B) in cells infected with Ad5pdys EAMs. 

Figures 16A-C depict immunofluorescence of dystrophin expression in wild type 
MM 14 myotubes (A), uninfected mdx (B) and infected mdx myotubes (C). 

Figure 17 is a schematic representation of the MCK/lacZ constructs tested to determine 
20 what portion of the - 3.3 kb DNA fragment containing the enhancer/promoter of the MCK 
gene is capable of directing high levels of expression of linked genes in muscle cells. 

Figure 18 is a schematic representation of a GFP/p-gal reporter construct suitable for 
assaying the expression of Cre recombinase in mammalian cells. 

Figure 19 is a schematic representation of the recombination event between the loxP 
25 shuttle vector and the Ad5d/700l genome. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

The term "gene" refers to a DNA sequence that comprises control and coding 
30 sequences necessary for the production of a polypeptide or precursor thereof. The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding sequence so 
long as the desired enzymatic activity is retained. The term "gene" encompasses both cDNA 
and genomic forms of a given gene. 
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The term "wild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type gene 
is that which is most frequently observed in a population and is thus arbitrarily designated the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
to a gene or gene product which displays modifications in sequence and or functional 
properties (i.e.. altered characteristics) when compared to the wild-type gene or gene product. 
It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two 
or more deoxyribonucleotides or ribonucleotides, usually more than three (3). and typically 
more than ten (10) and up to one hundred (100) or more (although preferably between twenty 
and thirty). The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis, DNA replication, reverse transcription, or a 

combination thereof. 

As used herein, the term "regulatory element" refers to a genetic element which 
controls some aspect of the expression of nucleic acid sequences. For example, a promoter is 
a regulatory element which facilitates the initiation of transcription of an operably linked 
coding region. Other regulatory elements are splicing signals, polyadenylation signals. 

termination signals, etc. (defined infra). 

Transcriptional control signals in eucaryotes comprise "promoter" and "enhancer" 
elements. Promoters and enhancers consist of short arrays of DNA sequences that interact 
specifically with cellular proteins involved in transcription [Maniatis. T. et al. Science 
236:1237 (1987)]. Promoter and enhancer elements have been isolated from a variety of 
eukaryotic sources including genes in yeast, insect and mammalian cells and viruses 
(analogous control elements, i.e., promoters, are also found in procaryotes). The selection of 
a particular promoter and enhancer depends on what cell type is to be used to express the 
protein of interest. Some eukaryotic promoters and enhancers have a broad host range while 
others are functional in a limited subset of cell types [for review see Voss. S.D. et al. Trends 
Biochem. ScL 11:287 (1986) and Maniatis. T. et al. supra (1987)]. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
• containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g. 
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mammal). DNA sequences necessary for expression in procaryotes include a promoter, 
optionally an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The terms "in operable combination", "in operable order" and "operably linked" as 
used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid 
molecule capable of directing the transcription of a given gene and/or the synthesis of a 
desired protein molecule is produced. The term also refers to the linkage of amino acid 
sequences in such a manner so that a functional protein is produced. 

The term "genetic cassette" as used herein refers to a fragment or segment of DNA 
containing a particular grouping of genetic elements. The cassette can be removed and 
inserted into a vector or plasmid as a single unit. A plasmid backbone refers to a piece of 
DNA containing at least plasmid origin of replication and a selectable marker gene (e.g., an 
antibiotic resistance gene) which allows for selection of bacterial hosts containing the plasmid: 
the plasmid backbone may also include a polylinker region to facilitate the insertion of 
genetic elements within the plasmid. When a particular plasmid is modified to contain non- 
plasmid elements (e.g.. insertion of Ad sequences and/or a eukaryotic gene of interest linked 
to a promoter element), the plasmid sequences are referred to as the plasmid backbone. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5' phosphate of one mononucleotide pentose ring is attached to the V oxygen of its 
neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5 ? end" if its 5 ? phosphate is not linked to the 3" oxygen of a 
mononucleotide pentose ring and as the "3 ? end" if its 3' oxygen is not linked to a 5 ? 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5" and 3* ends. 

When two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3* end of one oligonucleotide 
points towards the 5 ? end of the other, the former may be called the "upstream" 
oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
initiation of synthesis when placed under conditions in which primer extension is initiated. 
An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 
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A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary nucleotide fragment may be 
attached to the 5* end of the primer, with the remainder of the primer sequence being 
substantially complementary to the strand. Non-complementary bases or longer sequences can 
be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

"Hybridization" methods involve the annealing of a complementary sequence to the 
target nucleic acid (the sequence to be detected). The ability of two polymers of nucleic acid 
containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the "hybridization" 
process by Marmur and Lane, Proc. Natl Acad. Sci. USA 46:453 (1960) and Doty et ai. 
Proc. Natl Acad ScL USA 46:461 (1960) have been followed by the refinement of this 
process into an essential tool of modern biology. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aligned with the nucleic acid sequence such that the 5" end of one sequence is 
paired with the 3 ? end of the other, is in "antiparallel association." Certain bases not 
commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect: stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the art of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide, 
base composition and sequence of the oligonucleotide, ionic strength and incidence of 

mismatched base pairs. 

Stability of a nucleic acid duplex is measured by the melting temperature, or "T m ." 
The T m of a particular nucleic acid duplex under specified conditions is the temperature at 
which on average half of the base pairs have disassociated. The equation for calculating the 
T m of nucleic acids is well known in the art. 

The term "probe" as used herein refers to a labeled oligonucleotide which forms a 
duplex structure with a sequence in another nucleic acid, due to complementarity of at least 
one sequence in the probe with a sequence in the other nucleic acid. 
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The term "label" as used herein refers to any atom or molecule which can be used to 
provide a detectable (preferably quantifiable) signal, and which can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence, radioactivity 
colorimetry. gravimetry. X-ray diffraction or absorption, magnetism, enzymatic activity, and 
the like. 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which may comprise single- or double- 
stranded DNA or RNA. 

"Oligonucleotide primers matching or complementary to a gene sequence" refers to 
oligonucleotide primers capable of facilitating the template-dependent synthesis of single or 
double-stranded nucleic acids. Oligonucleotide primers matching or complementary to a gene 
sequence may be used in PCRs, RT-PCRs and the like. 

A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most often 
present in a given segment of the genes; the consensus sequence is the canonical sequence. 

The term "polymorphic locus" is a locus present in a population which shows variation 
between members of the population (/.<?., the most common allele has a frequency of less than 
0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
between members of the population (generally taken to be a locus at which the most common 
allele exceeds a frequency of 0.95 in the gene pool of the population). 

The term "microorganism" as used herein means an organism too small to be observed 
with the unaided eye and includes, but is not limited to bacteria, viruses, protozoans, fungi, 
and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic. intracellular parasites incapable 
of autonomous replication (/.*. replication requires the use of the host cell's machinery). 
Adenoviruses, as noted above, are double-stranded DNA viruses. The left and right inverted 
terminal repeats (ITRs) are short elements located at the 5 ? and 3* termini of the linear Ad 
genome, respectively and are required for replication of the viral DNA. The left ITR is 
located between 1-130 bp in the Ad genome (also referred to as 0-0.5 mu). The right ITR is 
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located from -3.7500 bp to the end of the genome (also referred to as 99.5-100 mu). The two 
ITRs are inverted repeats of each other. For clarity, the left ITR or 5" end is sued define the 
5 - and 3" ends of the ITRs. The 5" end of the left ITR is located at the extreme 5" end of the 
linear adenoviral genome: picturing the left ITR (LITR) as an arrow extending from the 5" 
end of the genome, the tail of the 5* ITR is located at mu 0 and the head of the left ITR is 
located at - 0.5 mu (further the tail of the left ITR is referred to as the 5" end of the left ITR 
and the head of the left ITR is referred to as the 3' end of the left ITR). The tail of the right 
or 3* ITR is located at mu 100 and the head of the rigt ITR is located at ~ mu 99.5; the head 
of the right ITR is referred to as the 5' end of the right ITR and the tail of the right ITR is 
referred to as the 3' end of the right ITR (RITR). In the linear Ad genome, the ITRs face 
each other with the head of each ITR pointing inward toward the bulk of the genome. When 
arranged in a "tail to tail orientation" the tails of each ITR (which comprise the 5 ? end of the 
LITR and the 3' end of the RITR) are located in proximity to one another while the heads of 
each ITR are separated and face outward (see for example, the arrangement of the ITRs in the 
EAM shown in Figure 10 herein). The "adenovirus packaging sequence" refers to the 4* 
sequence which comprises five (AI-AV) packaging signals and is required for encapsidation 
of the mature linear genome: the packaging signals are located from -194 to 358 bp in the Ad 

genome (about 0.5-1.0 mu). 

The phrase "at least one adenovirus gene coding region" refers to a nucleotide 
sequence containing more than one adenovirus gene coding gene. A "helper adenovirus" or 
"helper virus" refers to an adenovirus which is replication-competent in a particular host cell 
(the host may provide Ad gene products such as El proteins), this replication-competent virus 
is used to supply in trans functions (e.g., proteins) which are lacking in a second replication- 
incompetent virus: the first replication-competent virus is said to "help" the second 
replication-incompetent virus thereby permitting the propagation of the second viral genome 
in the cell containing the helper and second viruses. 

The term "containing a deletion within the E2b region" refers to a deletion of at least 
one basepair (preferably more than one bp and preferably at least 100 and most preferably 
more than 300 bp) within the E2b region of the adenovirus genome. An E2b deletion is a 
deletion that prevents expression of at least one E2b gene product and encompasses deletions 
within exons encoding portions of E2b-specific proteins as well as deletions within promoter 
and leader sequences. 
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An "adenovirus minichromosome" refers to a linear molecule of DNA containing the 
Ad ITRs on each end which is generated from a plasmid containing the ITRs and one or more 
gene of interest. The term "encapsidated adenovirus minichromosome" or "EAM" refers to an 
adenovirus minichromosome which has been packaged or encapsidated into a viral particle: 
plasmids containing the Ad ITRs and the packaging signal are shown herein to produce 
EAMs. When used herein, "recovering" encapsidated adenovirus minichromosomes refers to 
the collection of EAMs from a cell containing an EAM plasmid and a helper virus; this cell 
will direct the encapsidation of the minichromosome to produce EAMs. The EAMs may be 
recovered from these cells by lysis of the cell (e.g., freeze-thawing) and pelleting of the cell 
debris to a cell extract as described in Example 1 (Ex. 1 describes the recovery of Ad virus 
from a cell, but the same technique is used to recover EAMs from a cell). "Purifying" such 
minichromosomes refers to the isolation of the recovered EAMs in a more concentrated form 
(relative to the cell lysate) on a density gradient as described in Example 7: purification of 
recovered EAMs permits the physical separation of the EAM from any helper virus (if 
1 5 present). 

The term "transfection" as used herein refers to the introduction of foreign DNA into 
eukaryotic cells. Transfection may be accomplished by a variety of means known to the art 
including calcium phosphate-DNA co-precipitation. DEAE-dextran-mediated transfection. 
polybrene-mediated transfection. electroporation, microinjection, liposome fusion, lipofection. 

20 protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfection" or "stably transfected" refers to the introduction and 
integration of foreign DNA into the genome of the transfected cell. The term "stable 
transfectant" refers to a cell which has stably integrated foreign DNA into the genomic DNA. 
As used herein, the term "gene of interest" refers to a gene inserted into a vector or 

25 plasmid whose expression is desired in a host cell. Genes of interest include genes having 
therapeutic value as well as reporter genes. A variety of such genes are contemplated, 
including genes of interest encoding a protein which provides a therapeutic function (such as 
the dystrophin gene, which is capable of correcting the defect seen in the muscle of MD 
patients), the utrophin gene, the CFTR gene (capable of correcting the defect seen in cystitic 

30 fibrosis patients), etc.. 

The term "reporter gene" indicates a gene sequence that encodes a reporter molecule 
(including an enzyme). A "reporter molecule" is detectable in anv detection svstem 
including, but not limited to enzyme {e.g., ELISA. as well as enzyme-based histochemical 
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assays), fluorescent, radioactive, and luminescent systems. In one embodiment, the present 
invention contemplates the £. coli p-galactosidase gene (available from Pharmacia Biotech. 
Pistacataway. NJ). green fluorescent protein (GFP) (commercially available from Clontech. 
Palo Alto. CA). the human placental alkaline phosphatase gene, the chloramphenicol 
acetyltransferase (CAT) gene: other reporter genes are known to the art and may be 
employed. 

As used herein, the terms "nucleic acid molecule encoding." "DNA sequence 
encoding." and "DNA encoding" refer to the order or sequence of deoxyribonucleotides along 
a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes 
for the amino acid sequence. 

DESCRIPTION OF THE INVENTION 

The present invention provides improved adenovirus vectors for the delivery of 
recombinant genes to cells in vitro and in vivo. As noted above, the present invention 
contemplates two approaches to improving adenovirus vectors. The first approach generally 
contemplates a recombinant plasmid containing the minimal region of the Ad genome 
required for replication and packaging (i.e.. the left and right ITR and the packaging or ¥ 
sequence) along with one or more genes of interest; this recombinant plasmid is packaged into 
an encapsidated adenovirus minichromosome (EAM) when grown in parallel with an El- 
deleted helper virus in a cell line expressing the El proteins (e.g.. 293 cells). The 
recombinant adenoviral minichromosome is preferentially packaged. To prevent the 
packaging of the helper virus, a helper virus containing loxP sequences flanking the ¥ 
sequence is employed in conjunction with 293 cells expressing Cre recombinase: Cre-loxP 
mediated recombination removes die packaging sequence form the helper genome thereby 
preventing packaging of the helper during the production of EAMs. In the second approach, 
"damaged" or "deleted" adenoviruses containing deletions within the E2b region are 
employed. While the "damaged" adenovirus is capable of self-propagation in a packaging cell 
line expressing the appropriate E2b protein(s), the E2b-deleted recombinant adenovirus are 
incapable of replicating and expressing late viral gene products outside of the packaging cell 
line. 

In one embodiment, the self-propagating recombinant adenoviruses containing 
deletions in the E2b region of the adenovirus genome. In another embodiment, "gutted" 
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viruses are contemplated: these viruses lack all viral coding regions. In addition, packaging 

cell lines co-expressing El and E2b gene products are provided which allow the production of 
infectious recombinant virus containing deletions in the El and E2b regions without the use 
of helper virus. 

The Description of the Invention is divided into the following sections: I. Self- 
Propagating Adenovirus Vectors: II. Packaging Cell Lines: and III. Encapsidated Adenoviral 
Minichromosomes. 

I. Self-Propagating Adenovirus Vectors 

Self-propagating adenovirus (Ad) vectors have been extensively utilized to deliver 
foreign genes to a great variety of cell types in vitro and in vivo. "Self-propagating viruses" 
are those which can be produced by transfection of a single piece of DNA (the recombinant 
viral genome) into a single packaging cell line to produce infectious virus; self-propagating 
viruses do not require the use of helper virus for propagation. 

Existing Ad vectors have been shown to be problematic in vivo. This is due in part 
because current or first generation Ad vectors are deleted for only the early region 1 (El) 
genes. These vectors are crippled in their ability to replicate normally without the trans- 
complementation of El functions provided by human 293 cells, a packaging cell line [ATCC 
CRL 1573; Graham et al. (1977) J. Gen. Virol. 36:59]. Unfortunately, with the use of high 
titres of El deleted vectors, and the fact that there are El -like factors present in many cell 
types. El deleted vectors can overcome the block to replication and express other viral gene 
products [Imperiale et al. (1984) Mol. Cell Biol. 4:867: Nevins (1981) Cell 26:213: and 
Gaynor and Berk (1983) Cell 33:683]. The expression of viral proteins in the infected target 
cells elicits a swift host immune response, that is largely T-cell mediated [Yang and Wilson 
(1995) J. Immunol. 155:2564 and Yang el al. (1994) Proc. Natl. Acad. Sci. USA 91:4407]. 
The transduced cells are subsequently eliminated, along with the transferred foreign gene. In 
immuno- incompetent animals. Ad delivered genes can be expressed for periods of up to one 
year [Yang et al. (1994). supra; Vincent et al. (1993) Nature Genetics 5:130: and Yang et al. 
(1995) Proc. Natl. Acad. Sci. USA 92:7257]. 

Another shortcoming of first generation Ad vectors is that a single recombination 
event between the genome of an Ad vector and the integrated El sequences present in 293 
• cells can generate replication competent Ad (RCA), which can readily contaminate viral 
stocks. 
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In order to further cripple viral protein expression, and also to decrease the frequency 
of generating RCA. the present invention provides Ad vectors containing deletions in the E2b 
region. Propagation of these E2b-deleted Ad vectors requires cell lines which express the 
deleted E2b gene products. The present invention provides such packaging cell lines and for 
the first time demonstrates that the E2b gene products. DNA polymerase and preterminal 
protein, can be constitutively expressed in 293 cells along with the El gene products. With 
every gene that can be constitutively expressed in 293 cells comes the opportunity to generate 
new versions of Ad vectors deleted for the respective genes. This has immediate benefits; 
increased carrying capacity, since the combined coding sequences of the polymerase and 
preterminal proteins that can be theoretically deleted approaches 4.6 kb and a decreased 
incidence of RCA generation, since two or more independent recombination events would be 
required to generate RCA. Therefore, the novel El. Ad polymerase and preterminal protein 
expressing cell lines of the present invention enable the propagation of Ad vectors with a 
carrying capacity approaching 1 3 kb. without the need for a contaminating helper virus 
[Mitani ei al. (1995) Proc. Natl. Acad. Sci. USA 92:3854]. In addition, when genes critical 
to the viral life cycle are deleted {e.g., the E2b genes), a further crippling of Ad to replicate 
and express other viral gene proteins occurs. This decreases immune recognition of virally 
infected cells, and allows for extended durations of foreign gene expression. The most 
important attribute of El, polymerase, and preterminal protein deleted vectors, however, is 
their inability to express the respective proteins, as well as a predicted lack of expression of 
most of the viral structural proteins. For example, the major late promoter (MLP) of Ad is 
responsible for transcription of the late structural proteins LI through L5 [Doerfler. In 
Adenovirus DNA. The Viral Genome and Its Expression (Martinus Nijhoff Publishing Boston. 
1986)]. Though the MLP is minimally active prior to Ad genome replication, the rest of the 
late genes get transcribed and translated from the MLP only after viral genome replication has 
occurred [Thomas and Mathews (1980) Cell 22:523]. This cis-dependent activation of late 
gene transcription is a feature of DNA viruses in general, such as in the growth of polyoma 
and SV-40. The polymerase and preterminal proteins are absolutely required for Ad 
replication (unlike the E4 or protein IX proteins) and thus their deletion is extremely 
detrimental to Ad vector late gene expression. 



- 18 - 



WO 98/17783 PCT/US97/19541 
II. Packaging Cell Lines Constitutive^ Expressing E2b Gene 

Products 

The present invention addresses the limitations of current or first generation Ad 
vectors by isolating novel 293 cell lines coexpressing critical viral gene functions. The 
present invention describes the isolation and characterization of 293 cell lines capable of 
constitutively expressing the Ad polymerase protein. In addition, the present invention 
describes the isolation of 293 cells which not only express the El and polymerase proteins, 
but also the Ad-preterminal protein. The isolation of cell lines coexpressing the El, Ad 
polymerase and preterminal proteins demonstrates that three genes critical to the life cycle of 
Ad can be constitutively coexpressed, without toxicity. 

In order to delete critical genes from self-propagating Ad vectors, the proteins encoded 
by the targeted genes have to first be coexpressed in 293 cells along with the E 1 proteins. 
Therefore, only those proteins which are non-toxic when coexpressed constitutively (or toxic 
proteins inducibly-expressed) can be utilized. Coexpression in 293 cells of the El and E4 
genes has been demonstrated (utilizing inducible, not constitutive, promoters) [Yeh et al. 
(1996) J. Virol. 70:559; Wang et al (1995) Gene Therapy 2:775; and Gorziglia et al (1996) 
J. Virol. 70:4173]. The El and protein IX genes (a virion structural protein) have been 
coexpressed [Caravokyri and Leppard (1995) J. Virol. 69:6627], and coexpression of the El, 
E4. and protein IX genes has also been described [Krougliak and Graham (1995) Hum. Gene 
Ther. 6:1575]. 

The present invention provides for the first time, cell lines coexpressing El and E2b 
gene products. The E2b region encodes the viral replication proteins which are absolutely 
required for Ad genome replication [Doerfler. supra and Pronk et al (1992) Chromosoma 
102:S39-S45]. The present invention provides 293 cells which constitutively express the 140 
kD Ad-polymerase. While other researchers have reported the isolation of 293 cells which 
express the Ad-preterminal protein utilizing an inducible promoter [Schaack et al (1995) J. 
Virol. 69:4079], the present invention is the first to demonstrate the high-level, constitutive 
coexpression of the El. polymerase, and preterminal proteins in 293 cells, without toxicity. 
These novel cell lines permit the propagation of novel Ad vectors deleted for the El, 
polymerase, and preterminal proteins. 
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III. Encapsidated Adenoviral Minichromosomes 

The present invention also provides encapsidated adenovirus minichromosome (EAM) 
consisting of an infectious encapsidated linear genome containing Ad origins of replication, 
packaging signal elements, a reporter gene (e.g.. a p-galactosidase reporter gene cassette) and 
a gene of interest (e.g.. a full length (14 kb) dystrophin cDNA regulated by a muscle specific 
enhancer/promoter). EAMs are generated by cotransfecting 293 cells with supercoiled 
plasmid DNA (e.g.. pAd5(idys) containing an embedded inverted origin of replication (and the 
remaining above elements) together with linear DNA from El -deleted virions expressing 
human placental alkaline phosphatase (hpAP) (a helper virus). All proteins necessary for the 
generation of EAMs are provided in trans from the hpAP virions and the two can be 
separated from each other on equilibrium CsCl gradients. These EAMs are useful for gene 
transfer to a variety of cell types both in vitro and in vivo. 

Adenovirus-mediated gene transfer to muscle is a promising technology for gene 
therapy of Duchenne muscular dystrophy (DMD). However, currently available recombinant 
adenovirus vectors have several limitations, including a limited cloning capacity of -8.5 kb. 
and the induction of a host immune response that leads to transient gene expression of 3 to 4 
weeks in immunocompetent animals. Gene therapy for DMD could benefit from the 
development of adenoviral vectors with an increased cloning capacity to accommodate a full 
length (-14 kb) dystrophin cDNA. This increased capacity should also accommodate gene 
regulatory elements to achieve expression of transduced genes in a tissue-specific manner. 
Additional vector modifications that eliminate adenoviral genes, expression of which is 
associated with development of a host immune response, might greatly increase long term 
expression of virally delivered genes in vivo. The constructed encapsidated adenovirus 
minichromosomes of the present invention are capable of delivering up to 35 kb of non-viral 
exogenous DNA. These minichromosomes are derived from bacterial plasmids containing 
two fused inverted adenovirus origins of replication embedded in a circular genome, the 
adenovirus packaging signals, a (igalactosidase reporter gene and a full length dystrophin 
cDNA regulated by a muscle specific enhancer/promoter. The encapsidated minichromosomes 
are propagated in vitro by rrans-complementation with a replication defective (E1+E3 deleted) 
helper virus. These minichromosomes can be propagated to high titer (>10 8 /ml) and purified 
on CsCl gradients due to their buoyancy difference relative to helper virus. These vectors are 
able to transduce myogenic cell cultures and express dystrophin in myotubes. These results 
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demonstrate that encapsidated adenovirus minichromosomes are useful for gene transfer to 
muscle and other tissues. 

The present invention further provides methods for modifying the above-described 
EAM system to enable the generation of high titer stocks of EAMs with minimal helper virus 
contamination. Preferably the EAM stocks contain helper virus representing less than 1%, 
preferably less than 0.1% and most preferably less than 0.01% (including 0.0%) of the final 
viral isolate. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The CreAoxP excision method is employed to remove the packaging 
signals from the helper virus thereby preventing the packaging of the helper virus used to 
provide in trans viral proteins for the encapsidation of the recombinant adenovirus 
minichromosomes. The second approach to reducing or eliminating helper virus in EAM 
stocks is the use of improve physical methods for separating EAM from helper virus. 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: M 
(molar): mM (millimolar); |iM (micromolar): mol (moles); mmol (millimoles); jimol 
(micromoles); nmol (nanomoles): mu or m.u. (map unit); g (gravity); gm (grams); mg 
(milligrams): ^g (micrograms); pg (picograms); L (liters); ml (milliliters); \xl (microliters); cm 
(centimeters): mm (millimeters); ^m (micrometers); nm (nanometers); hr (hour): min 
(minute): msec (millisecond); T (degrees Centigrade); AMP (adenosine 5 '-monophosphate); 
cDNA (copy or complimentary DNA); DTT (dithiotheritol); ddH : 0 (double distilled water); 
dNTP (deoxyribonucleotide triphosphate); rNTP (ribonucleotide triphosphate); ddNTP 
(dideoxyribonucleotide triphosphate); bp (base pair); kb (kilo base pair); TLC (thin layer 
chromatography); tRNA (transfer RNA); nt (nucleotide); VRC (vanadyl ribonucleoside 
complex): RNase (ribonuclease); DNase (deoxyribonuclease); poly A (polyriboadenylic acid); 
PBS (phosphate buffered saline); OD (optical density); HEPES (N-[2- 
Hydroxyethyl]piperazine-N-[2-ethanesulfonic acid]): HBS (HEPES buffered saline); SDS 
(sodium dodecyl sulfate); Tris-HCl (tris[Hydroxymethyl]aminomethane-hydrochloride); rpm 
(revolutions per minute); ligation buffer (50 mM Tris-HCl, 10 mM MgCL 10 mM 
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dithiothreitol. 25 ug/ml bovine serum albumin, and 26 uM NAD+. and pH 7.8): EGTA 
(ethylene glycol-bis(|J-aminoethyl ether) N. N. N\ N'-tetraacetic acid): EDTA 
(ethvlenediaminetetracetic acid): ELISA (enzyme linked immunosorbant assay): LB (Luria- 
Bertani broth: 10 g tryptone. 5 g yeast extract, and 10 g NaCl per liter. pH adjusted to 7.5 
with IN NaOH); superbroth (12 g tryptone. 24 g yeast extract. 5 g glycerol. 3.8 g KH : P0 4 
and 12.5 g. K 2 HP0 4 per liter); DMEM (Dulbecco's modified Eagle's medium): ABI (Applied 
Biosvstems Inc.. Foster City, CA): Amersham (Amersham Corporation. Arlington Heights. 
IL): ATCC (American Type Culture Collection. Rockville, MY): Beckman (Beckman 
Instruments Inc.. Fullerton CA): BM (Boehringer Mannheim Biochemicals. Indianapolis. IN); 
Bio-101 (Bio-101. Vista. CA); BioRad (BioRad. Richmond, CA); Brinkmann (Brinkmann 
Instruments Inc. Wesbury. NY); BRL, Gibco BRL and Life Technologies (Bethesda Research 

Laboratories. Life Technologies Inc.. Gaithersburg. MD): CRI (Collaborative Research Inc. 

Bedford. MA): Eastman Kodak (Eastman Kodak Co.. Rochester. NY): Eppendorf (Eppendorf. 

Eppendorf North America. Inc.. Madison. WI): Falcon (Becton Dickenson Labware. Lincoln 

Park. NJ): IBI (International Biotechnologies. Inc., New Haven. CT): ICN (ICN Biomedicals. 

Inc.. Costa Mesa. CA); lnvitrogen (Invitrogen. San Diego, CA); New Brunswick (New 

Brunswick Scientific Co. Inc.. Edison. NJ): NEB (New England BioLabs Inc.. Beverly, MA); 

NEN (Du Pont NEN Products. Boston. MA); Pharmacia (Pharmacia LKB Gaithersburg, MD); 

Promega (Promega Corporation. Madison. Wl): Stratagene (Stratagene Cloning Systems, La 

Jolla. CA): UVP (UVP. Inc.. San Gabreil. CA); USB (United States Biochemical Corp.. 

Cleveland. OH); and Whatman (Whatman Lab. Products Inc. Clifton. NJ). 

Unless otherwise indicated, all restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (NEB) and used according to the manufacturers 



directions. 



EXAMPLE I 

Generation Of Packaging Cell Lines That Coexpress 
The Adenovirus El And DNA Polymerase Proteins 

In this example, packaging cell lines coexpressing Ad El and polymerase proteins 
were described. These cell lines were shown to support the replication and growth of H5ts36, 
an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These 
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polymerase-expressing packaging cell lines can be used to prepare Ad vectors deleted for the 
El and polymerase functions. 



a) Tissue Culture And Virus Growth 

LP-293 cells (Microbix Biosystems. Toronto) were grown and serially passaged as 
suggested by the supplier. 

Plaque assays were performed in 60 mm dishes containing cell monolayers at ~ 90% 
confluency. The appropriate virus dilution in a 2% DMEM solution was dripped onto the 
cells, and the plates incubated at the appropriate temperature for one hour. The virus 
containing media was aspirated, the monolayer was overlaid with 10 mis of a pre-warmed 
EMEM agar overlay solution (0.8% Noble agar, 4% fetal calf serum, and antibiotics) and 
allowed to solidify. After the appropriate incubation time (usually 7 days for incubations at 
38.5°C and 10-12 days for incubations at 32°C), five mis of the agar-containing solution 
containing 1.3% neutral red was overlaid onto the infected dishes and plaques were counted 
the next day. An aliquot of the virus H5ts36 [Freimuth and Ginsberg (1986) Proc. Natl. 
Acad. Sci. USA 83:7816] was utilized to produce high titre stocks after infection of 293 cells 
at 32°C. H5ts36 is an Ad5-derived virus defective for viral replication at the nonpermissive 
temperature [Miller and Williams (1987) J. Virol. 61:3630]. 

The infected cells were harvested after the onset of extensive cytopathic effect, 
pelleted by centrifugation and resuspended in 10 mM Tris-Cl, pH 8.0. The lysate was freeze- 
thawed three times and centrifuged to remove the cell debris. The cleared lysate was applied 
to CsCK step gradients (heavy CsCl at density of 1.45 g/ml. the light CsCl at density of 1.20 
g/ml). ultracentrifuged. and purified using standard techniques [Graham and Prevec (1991) In 
Methods in Molecular Biology, Vol 7: Gene Transfer and Expression Protocols. Murray 
(ed.). Humana Press, Clifton, NJ, pp. 109-128]. The concentration of plaque forming units 
(pfu) of this stock was determined at 32°C as described above. Virion DNA was extracted 
from the high titre stock by pronase digestion, phenol-chloroform extraction, and ethanol 
precipitation. The leakiness of this stock was found to be <1 in 2000 pfu at the non- 
permissive temperature, consistent with previous reports [Miller and Williams (1987), supra]. 

b) Plasmids 

The expression plasmid pRSV-pol [Zhao and Padmanabhan (1988) Cell 55:1005] 
contains sequences encoding the Ad2 polymerase mRNA (including the start codon from the 
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exon at map unit 39) under the transcriptional control of the Rous Sarcoma Virus 
LTR/promoter element: the Ad2 DNA polymerase sequences are flanked on the 3* end by the 
SV-40 small t intron and SV-40 polyadenylation addition site (see Fig. 1A). Figure 1A 
provides a schematic representation of the Ad polymerase expression plasmid pRSV-pol. 
pRSV-pol includes the initiator methionine and amino-terminal peptides encoded by the exon 
at m.u. 39 of the Ad genome. The location of the PCR primers p602a and p2158c. the two 
Seal 1 kb probes utilized for Northern analyses, the polymerase terminator codon. and the 
polvadenvlation site of the IVa2 gene (at 11.2 m.u. of the Ad genome) are indicated. 

The expression plasmid pRS V-pTP [Zhao and Padmanabhan ( 1 988). supra] contains 
sequences encoding the Ad2 preterminal protein (including the amino terminal peptides 
encoded by the exon at map unit 39 of the Ad genome) under the transcriptional control of 
the Rous Sarcoma Virus LTR/promoter element; the pTP sequences are flanked on the 3' end 
by the SV-40 small-t intron and SV-40 polyadenylation signals (see Figure IB for a schematic 
of pRS V-PTP). Figure IB also shows the location of the EcoKV subfragment utilized as a 
probe in the genomic DNA and cellular RNA evaluations, as well as the initiator methionine 
codon present from map unit 39 in the Ad5 genome. The locations of the H5ml90 and 
H5«/A100 insertions are shown relative to the preterminal protein open-reading frame. The 
following abbreviations are used in Figure 1: ORF. open reading frame: small-t. small tumor 

antigen, and m.u.. map unit. 

P CEP4 is a plasmid containing a hygromycin expression cassette (Invitrogen). 

pFGMO (Microbix Biosystems Inc. Toronto, Ontario) is a plasmid containing 
sequences derived from Ad5<//309 which contain a deletion/substitution in the E3 region 
[Jones and Shek (1979) Cell 17:683]. P FG140 is infectious in single transfections of 293 
cells and is used as a control for transfection efficiency. 

c) Transfection Of 293 Cells 

LP-293 cells were cotransfected with BamHl linearized pRSV-pol and BamHl 
linearized pCEP4 at a molar ratio of 10:1 using a standard CaP0 4 precipitation method 
[Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Press. Plainview. NY. pp.16.33-16.36]. In addition. 293 cells were cotransfected with or with 
BamHl linearized pRSV-pol. BamHl linearized pRSV-pTP and pCEP using a molar ratio of 
' |0:1 (non-selectable:selectable plasmids). 
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Forty-eight hours after transfection. the cells were passaged into media containing 
hygromycin at 100 |ig/mL. Individual hygromycin resistant colonies were isolated and 
expanded. 



3 d) Isolation Of Ad Polymerase Expressing 293 Cell Lines 

Twenty hygromycin resistant cell lines were expanded and screened for the ability to 
express the Ad polymerase protein. Initially, the individual cell lines were assayed for the 
ability to support growth of the viral polymerase mutant H5ts36 using the plaque assay 
described in section a) above. It was speculated that constitutive expression of the wild type 

10 Ad polymerase protein in a clonal population of 293 cells should allow the growth of H5ts36 
at 38.5°C. However, it was unclear if constitutive expression of the Ad polymerase would be 
toxic when coexpressed with the El proteins present in 293 cells. Similar toxicity problems 
have been observed with the Ad ssDBP. and the pTP [Klessig ct ai (1984) Mol. Cell. Biol. 
4:1354 and Schaack el ai (1995) J. Virol. 69:4079]. 

15 Of the twenty hygromycin resistant cell lines isolated, seven were able to support 

plaque formation with H5ts36 at the non-permissive temperature, unlike the parental LP-293 
cells: these cell lines were named B-6. B-9, C-l. C-4, C-7. C-13 and C-14 (see Table 1) (the 
B-6 and B-9 cell lines received only the pRSV-pol and CEP4 plasmids: C-l. C-4. C-7, C-13 
and C-14 cell lines received the pRSV-pol, pRSV-pTP and CEP4 plasmids). For the results 

20 shown in Table 1 . dishes (60 mm) of near confluent cells of each cell line were infected with 
the same dilution of H5ts36 at the temperature indicated, overlaid with agar media, and 
stained for plaques as outlined in section a. Passage number refers to the number of serial 
passages after initial transfection with the plasmid pRSV-pol. 

The cell line B-6 produced plaques one day earlier than the other Ad polymerase- 

25 expressing cell lines, which may reflect increased polymerase expression (see below). Cell 
line B-9 demonstrated an increased doubling time whereas each of the other cell lines 
displayed no growth disadvantages relative to the parental LP-293 cells. As shown in Table 
1, even after multiple passages (in some instances up to four months of serial passaging) the 
cells were still capable of H5ts36 plaque formation at the non-permissive temperature, 

30 indicating that the RSV-LTR/promoter remained active for extended periods of time. 

However, the cell lines B-9 and C-13 displayed a decreased ability to plaque the virus at 32°C 
as well as at 38.5°C suggesting that a global viral complementation defect had occurred in 
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these cell lines after extended passaging. The remaining cell lines screened at later passages 
demonstrated no such defect, even after 20 passages {e.g.. cell line B-6. Table 1). 

TABLE 1 

Plaquing Ability Of H5ts36 At The Non-Permissive Temperature Utilizing 293-Ad Polymerase Expressing 

Cell Lines 



Cell Line 



293 



B-6 



B-9 



C-l 



C-4 



C-7 



C-I 



C-14 



Passage Number 



20 
5 



14 



13 



14 

5 



14 

5 



27 
3 



27 



Number Of Plaques At: 



32.0°C 
-500 
>500 



>500 
>500 



90 



>500 



>500 



>500 



>500 
>500 



►500 
>500 



120 
>500 



>500 



38.5°C 
_0 

>500 



>500 
>500 



18 



>500 



>500 



>500 



>500 
>500 



>500 
oOO 



4 

>500 



170 



e) Genomic Analysis Of Ad Polymerase-Expressing Cell Lines 

Genomic DNA from LP-293 cells and each of the seven cell lines able to complement 
H5ts36 at 38.5°C were analyzed by PCR for the presence of pRSV-pol derived sequences. 
Genomic DNA from LP-293 cells and the hygromycin resistant cell lines were harvested 
using standard protocols (Sambrook et aL supra) and 200 ng of DNA from each cell line was 
analvzed bv PCR in a solution containing 2 ng/mL of primers p602a and p2158c, 10 mM 
TrisHCl. pH 8.3. 50 mM KC1, 1.5 mM MgCL and 0.001% gelatin. The forward primer. 
P 602a [5-TTCATTTTATGTTTCAGGTTC AGGG-3' (SEQ ID NO:2)] is located in the SV- 
40 polyadenylation sequence. The reverse primer p2158c [5*- 

TTACCGCC ACACTCGC AGGG-3 ' (SEQ ID NO:3)] is Ad-sequence specific with the 5' 
nucleotide located at position 3394 of the Ad 5 genome [numbering according to Doerfler 
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(1986) Adenovirus DXA. The Viral Genome and Its Expression. Nijhoff. Boston. MA. pp. 1- 
95]. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94°C for 3 min. 3 cycles of denaturation at 94°C 
5 for 30 sec. annealing at 50°C for 30 sec. and extension at 72°C for 60 sec, followed by 

another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72°C for 10 minutes. PCR products were separated on a 1.0 % agarose gel and visualized 
with ethidium bromide staining (Figure 2). A 1 kb ladder (Gibco-BRL) was used as a size 
marker, and the plasmid pRSV-pol was used as a positive control. The -750 bp PCR products 
10 are indicated by an arrow in Figure 2. 

As shown in Figure 2, all cell lines capable of H5ts36 plaque formation at 38.5°C 
contained the Ad pol DNA sequences, whereas the LP-293 cells did not vield anv 
amplification product with these primers. This result demonstrates that each of the selected 
cell lines stably co-integrated not only the hygromycin resistance plasmid pCEP4. but also 
15 pRSV-pol. 

f) Complementation Of The Replication Defect Of H5ts36 By 
Ad Polymerase Expressing Cell Lines 

The C to T transition at position 7623 of the H5ts36 genome alters the DNA binding 
20 affinity of the Ad polymerase protein, rendering it defective for viral replication at non- 
permissive temperatures [Chen el ai (1994) Virology 205:364; Miller and Williams (1987) J. 
Virol. 61:3630; and Wilkie et ai (1973) Virology 51:499]. To analyze the functional activity 
of the Ad polymerase protein expressed by each of the packaging cell lines, a viral 
replication-complementation assay was performed. LP-293 cells or the hygromycin resistant 
25 cell lines were seeded onto 60 mM dishes at densities of 2.5-3.0 x 10 6 per dish, infected with 
H5ts36 at a multiplicity of infection (MOI) of 10, and incubated for 24 hours at 38.5°C, or 48 
hours at 32°C. Total DNA was harvested from each plate, then 2 [ig of each sample were 
digested with HindllL separated on a 1.0% agarose gel. transferred to a nylon membrane, and 
hybridized with 32 P-labeled H5ts36 virion DNA. Densitometric analysis of the 8.010 bp 
30 HindlU fragment in each lane was performed on a phophoroimager (Molecular Dynamics) 
utilizing a gel image processing system (IP Lab Version 1.5, Sunnyvale. CA). 
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The resulting autoradiograph is shown in Figure 3. In Figure 3. the lane marked 
"Std." (standard) contains 1 ug of //mdlll-digested H5ts36 virion DNA. The 8.010 bp 
tfmdIII fragments analyzed by densitometry are indicated by an arrow. 

As shown in Figure 3. H5ts36 had a diminished ability to replicate in LP-293 cells at 
the non-permissive temperature. In contrast, all seven of the previously selected cell lines 
were able to support replication of H5ts36 virion DNA at 38.5°C to levels approaching those 

occurring in LP-293 cells at 32°C. 

A densitometric analysis of the amount of H5ts36 viral DNA replicated in each of the 
cell lines at permissive and nonpermissive temperatures is presented in Table 2 below. For 
this assav. the relative amounts of the 8.010 bp ffwdlll fragment were compared. The 
relative levels of H5ts36 virion DNA replication determined by densitometric analysis of the 
8.010 bp //wdlll fragment isolated from each of the cell line DNA samples. The surface area 
of the 8.010 bp fragment in 293 cells incubated at 38.5°C was designated as 1. and includes 
some replicated H5ts36 virion DNA. The numbers in each column represent the ratio 
between the densities of the 8.010 bp fragment isolated in the indicated cell line and the 
density of the same band present in LP-293 cells at 38.5°C. 

As shown in Table 2. the levels of replication at the permissive temperature were all 
within four-fold of each other, regardless of which cell line was analyzed, but at the non- 
permissive temperature LP-293 cells reveal the H5ts36 replication defect. The viral bands 
that were present in the LP-293 DNA sample at 38.C represented input virion DNA as well as 
low level replication of H5ts36 DNA. which is generated due to the leakiness of the ts 
mutation at the high MOI utilized in this experiment. The Ad polymerase-expressing cell 
lines were all found to be capable of augmenting 
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TABLE 2 

Densitometry Analysis Of H5ts36 Replication 



Cell Line 


Ratios Of 8,010 bp ////idlll Fragment Generated At: 


32.0° C 


38.5 °C 




42.6 


1.0 


D C 
D-O 


27.3 


57.2 


B-9 


69.0 


15.8 


C-1 


113.6 


34.1 


C-4 


100.2 


65.5 


C-7 


114.9 


75.0 


C-13 


43.1 


42.2 


C-14 


46.7 


27.8 



H5ts36 genome replication. Although one cell line (B-9) allowed H5ts36 replication to levels 
only 16 fold greater than LP-293 cells, this was the same cell line that was observed to 

15 display poor growth properties. Each of the remaining Ad polymerase-expressing cell lines 
allowed substantially greater replication of H5ts36 at non-permissive temperatures, compared 
to LP-293 cells (Table 2). An enhancement of replication up to 75 fold above that of LP-293 
cells was observed with the cell line C-7 at 38.5°C. A substantially more rapid onset of viral 
cytopathic effect in Ad polymerase-expressing cell lines was observed at either temperature. . 

20 These estimates of H5ts36 replication-complementation are conservative, since they have not 
been adjusted for the low level replication of H5ts36 at 38.5°C [Miller and Williams (1987). 
supra]. The leakiness of the H5ts36 mutation could potentially be overcome with the use of a 
virus deleted for the Ad polymerase gene. 

25 g) RNA Analysis Of Ad Polymerase-Expressing Cell Lines 

Total RNA was extracted from each of the cell lines using the RNAzol method 
(Teltest. Inc.. Friendswood. TX 77546; Chomczynski and Sacchi (1987) Anal. Biochem. 
162:156]. Fifteen micrograms of RNA from each cell line was electrophoresed on a 0.8% 
agarose-formaldehyde gel. and transferred to a Nytran membrane (Schleicher & Schuell) by 
30 blotting. The filter was UV crosslinked, and analyzed by probing with the two 32 P-labeled 1 
kb Seal subfragments of Ad which span positions 6095-8105 of the Ad5 genome (see Figure 
I A). These two Seal subfragments of the Ad genome are complimentary to the 5 ? end of the 
Ad polymerase mRNA. The resulting autoradiograph is shown in Figure 4. In Figure 4. the 
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location of the smaller and larger species of Ad polymerase mRNA are indicated relative to 
the 28S and 18S ribosomal RNAs. The aberrant transcript expressed by the B-9 cell line is 

indicated by an arrow. 

The results shown in Figure 4 revealed that the RNA derived from cell lines B-6 and 
C-7 contained two species of RNA. estimated to be -4800 and -7000 nt in length, while LP-293 
derived RNA had no detectable hybridization signal. The presence of two polymerase RNA 
species suggests that the polyadenylation signal of the Ad IVa2 gene (present in the Ad-pol 
construct, see Fig. 1A) is being utilized by the cell RNA processing machinery, in addition to 
the SV-40 polyadenylation signal. Similar analysis of RNA derived from the cell lines C-l. 
C-4. C-l 3. and C-l 4 also detected the same two transcripts as those detected in the RNA of 
cell lines B-6 and C-7. but at decreased levels, suggesting that even low levels of Ad 
polymerase mRNA expression can allow for the efficient replication of polymerase mutants 
such as H5ts36. The cell line B-6 expressed high levels of polymerase transcript and can 
plaque H5ts36 one day earlier than the other cell lines at 38.5°C. suggesting a causal 
relationship. It is interesting to note that the two polymerase transcripts are also detected in 
RNA isolated from cell line B-9, but substantial amounts of a larger RNA transcript (size > 
10 kb) is also present (see Figure 4). The high level production of the aberrant message may 
be related to the increased doubling time previously noted in this cell line. 

h) Transfectability Of Ad Polymerase-Expressing Cell Lines 

The ability of Ad polymerase-expressing 293 cell lines to support production of 
H5ts36 virions after transfection with H5ts36 genome DNA was examined as follows. 293 
cells as well as hygromycin resistant cell lines were grown to near confluency on 60 mm 
dishes and transfected with either 3 ug of purified H5ts36 virion DNA, or with 3.5 ug of the 
plasmid pFG140 (Microbix Biosystems). using the canonic lipid Lipofectamine (Gibco-BRL). 
Cells that received the H5ts36 virion DNA were incubated at 32°C for 14 days, or 38.5°C for 
10 days. The pFG140 transfected cells were incubated at 37.5°C for 10 days. All plates 
were then stained with the neutral red agar overlay and plaques were counted the next day. 
The results are shown in Table 3. 
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TABLE 3 



Cell line 


Number Of Plaques At: 


32.0°C 


38.5°C 


LP-293 


>500 


0 


B-6 


>500 




B-9 


n.d.' 


n.d. 


C-1 


n.d. 


>500 


C-4 


n.d. 


>500 


C-7 


n.d. 


>500 


C-I3 


n.d. 


100 


C-14 


n.d. 


>500 



10 



n.d. = not determined. 



15 The results shown in Table 3 demonstrated that transfection of H5ts36 DNA at the 

non-permissive temperature allows for ample plaque production in all of the Ad polymerase- 
expressing cell lines tested, unlike the parental LP-293 cells. Cell line C-1 3 was at passage 
number 29. and demonstrated a somewhat decreased ability to generate plaques at this 
extended passage number. These same cell lines are also capable of producing plaques when 

20 transfected with the plasmid pFG140, a piasmid capable of producing infectious. El 

dependent Ad upon transfection of the parental 293 cells [Ghosh-Choudhury (1986) Gene 
50:161]. These observations suggest that the Ad polymerase expressing cell lines should be 
useful for the production of second generation Ad vectors deleted not only for the El genes, 
but also for the polymerase gene. As shown below in Examples 2 and 3, this is indeed the 

25 case. 
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EXAMPLE 2 

Isolation And Characterization Of Packaging Cell Lines That 
Coexpress The Adenovirus EL DNA Polymerase And Preterminal Proteins 

In Example L packaging cell lines coexpressing Ad El and polymerase proteins were 
• described. These cell lines were shown to support the replication and growth of H5ts36, an 
Ad with a temperature-sensitive mutation of the Ad polymerase protein. These polymerase- 
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expressing packaging cell lines can be used to prepare Ad vectors deleted for the El and 
polymerase functions. In this example. 293 cells cotransfected with both Ad polymerase and 
preterminal protein expression plasmids are characterized. Cell lines co-expressing the Ad El. 
polymerase and preterminal proteins can be used to prepare Ad vectors deleted for the El. 
polymerase and preterminal protein (pTP) functions. 

a) Tissue Culture And Virus Propagation 

The use of LP-293 cells (Microbix Biosystems Inc.. Toronto). Ad-polymerase 
expressing cell lines, and plaquing efficiency assays of Ad viruses was conducted as described 
in Example 1. All cells were maintained in 10% fetal bovine serum supplemented DMEM 
media (GIBCO) in the presence of antibiotics. The virus H5*mM00 [Freimuth and Ginsberg 
(1986) Proc. Natl. Acad. Sci. USA 83:7816] has a temperature sensitive (ts) mutation caused 
by a three base pair insertion within the amino terminus of the preterminal protein, in addition 
to a deletion of the El sequences (see Figure IB). H5j«6100 was propagated and titred at 
32.0°C in LP-293 cells: the leakiness of this stock was less than 1 per 1000 plaque-forming 
units (pfu) at the nonpermissive temperature of 38.5°C. A lower titer cell lysate contaning 
the virus H5/nl90 (which contains a 12 base-pair insertion within the carboxy-terminus of the 
preterminal protein as well as a deletion of the El region, see Figure IB) was provided by Dr. 
P. Freimuth [Freimuth and Ginsberg (1986), supra]. The polymerase and preterminal protein 
expressing cell lines were always maintained in media supplemented with hygromycin 
(Sigma) at 100 ug/mL. 

b) Isolation Of Ad Polymerase And Preterminal Protein 
Expressing 293 Cells 

The C-l. C-4. C-7. C-13, and C-14 cell lines (Ex. 1), which had been cotransfected 
with pRSV-pol. pRSV-pTP and CEP4. were screened for presence of pTP sequences and for 
the ability to support the growth of H5ts36 (ts for the Ad-polymerase), H5ml90. and 
H5si/M00 using plaque assays as described in Example 1. 
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i) Analysis Of Genomic DNA And Cellular RNA 

Cell lines that had received the preterminal protein expression plasmid were screen for 
the presence of pRSV-pTP sequences and El sequences. Total DNA was isolated from the 
LP-293. B-6 (transfected with pRSV-pol only) or C-7 (cotransfected with pRSV-pol and 
pRSV-pTP) cell lines, two micrograms (ng) of each DNA was codigested with the restriction 
enzymes Xbal and BamHI. electrophoretically separated in a 0.6% agarose geL and 
transferred onto a nylon membrane. The membrane was UV crosslinked. probed with both a 
1.8 kb Blnl-Xbal fragment (spans the El coding region) isolated from the plasmid pFG140, 
and a 1.8kb EcoRV sub fragment of Ad serotype 5 (spans the preterminal protein coding 
sequences; see Fig. IB), both of which were random-primer radiolabeled with 32 P to a specific 
activity greater than 3.0 x 10 8 cpm/^g. The membrane was subsequently exposed to X-ray 
film with enhancement by a fluorescent screen. The resulting autoradiograph is shown in 
Figure 5. 

In Figure 5. the preterminal specific sequences migrated as an -1 1.0 kb DNA fragment 
while the El containing band migrated as a 2.3 kb DNA fragment. No hybridization of either 
probe to DNA isolated from either the LP-293 or B-6 cell lines was observed As shown in 
Figure 5. only the C-7 cell line genomic DNA had preterminal coding sequences, unlike the 
parental LP-293 cells, or the Ad-polymerase expressing B-6 cells. In addition, all cell lines 
had El specific sequences present at nearly equivalent amounts, demonstrating that the 
selection design has not caused the loss of the El sequences originally present in the LP-293 
cells. The results presented in Example 1 demonstrated that both the B-6 and C-7 cell lines 
contain polymerase specific sequences within their genomes, unlike the parental LP-293 cells. 

To confirm that transcription of preterminal protein was occurring, total RNA was 
isolated from each of the cell lines, transferred to nylon membranes, and probed to detect 
preterminal protein-specific mRNA transcripts as follows. Total cellular RNA was isolated 
from the respective cell lines and 15 \xg of total RNA from each cell line was transferred to 
nylon membranes. The membranes were probed with the 1.8 kb EcoRV radiolabeled 
subfragment of Ad5 (see Fig. 1 B) complementary to the preterminal protein coding region. 
The resulting autoradiograph is shown in Figure 6. 

As shown in Figure 6. a single mRNA of the expected size (~3 kb in length) is detected 
only in RNA derived from the C-7 cell line. No hybridization was detected in lanes 
containing RNA derived from the LP-293 or B-6 cell lines. In Example 1. it was 
demonstrated that the C-7 cell line also expresses high levels of the Ad polymerase mRNA. 
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Thus, the C-7 cell line constitutively expresses both the Ad polymerase and preterminal 
protein mRNAs along with El transcripts. 



ii) Plaquing Efficiency Of pTP Mutants On pTP- 

* 

Expressing Cell Lines 

The C-7 cell line was screened for the ability to transcomplement the growth of 
preterminal mutant viruses. The virus H5/M90 (contains a 12 base pair insertion located 
within the carboxy-terminus of the preterminal protein) has been shown to have a severe 
growth and replication defect, producing less than 10 plaque-forming units per cell [Freimuth 
and Ginsberg (1986) Proc. Natl. Acad. Sci. USA 83:7816]. The results are summarized in 
Table 4 below. For the results shown in Table 4, LP-293, B-6. or C-7 cells were seeded at a 
density of 2.0-2.5 x 10 6 cells per plate. The cells were infected with limiting dilutions of 
lysates derived from the preterminal protein-mutant viruses H5wl90 or H5jwM00. incubated 
at 38.5°C. and plaques counted after six days. As shown in Table 4. only the C-7 cell line 
could allow efficient plaque formation of H5/>il90 at 38.5°C (the H5wl90 lysate used to 
infect the cells was of a low titer, relative to the high titer H5sub\00 stock), while both the B- 
6 and C-7 cell lines had nearly equivalent plaquing efficiencies when H5.vwM00 was utilized 
as the infecting virus. 



TABLE 4 

Plaquing Efficiency Of Preterminal Protein- Mutant Viruses 



Virus 


[Mutation 
Location 


Plaque Titres (pfu/ml) 


LP-293 


B-6 


C-7 


H5//7I90 


carboxy-terminus 


<l x I0 : 


<l x 10 : 


1.4 x I0 5 


H5™6IOO 


amino-terminus 


<l x 10 4 


9.0 x 10 s 


4.5 x I0 H 



As shown in Table 4. when equivalent dilutions of H5wl90 were utilized, the plaquing 
efficiency of the C-7 cell line was at least 100-fold greater than that of the B-6 or LP-293 
cells. This result demonstrated that the C-7 cell line produces a functional preterminal 
protein, capable of trans-complementing the defect of the H5/>il90 derived preterminal 
protein. 

The cell lines were next screened for the ability to trans-complement with the 
temperature-sensitive virus, H5™M00. at nonpermissive temperatures. H5swM00 has a codon 
insertion mutation within the amino-terminus of the preterminal protein, as well as an El 
deletion. The mutation is responsible both for a temperature sensitive growth defect, as well 
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as a replication defect [Freimuth and Ginsberg (1986). supra and Schaack et ai (1995) J. 
Virol. 69:4079], The plaquing efficiency of the cell line C-7 was found to be at least 1000 
fold greater than that of the LP-293 cells (at non-permissive temperatures) (see Table 4). 
Interestingly, the cell line B-6 was also capable of producing large numbers of HSsi/MOO 
derived plaques at 38.5°C, even though it does not express any preterminal protein. This 
result suggested that the high level expression of the polymerase protein was allowing plaque 
formation of U5sub\00 in the B-6 cell line. To examine this possibility, the nature of 
HSsublOO growth in the various cell lines was examined. 

c) Complementation Of The Replication And Growth Defects 
Of HSmMOO 

The cell lines B-6 and C-7 were shown to overcome the replication defect of H5ts36 
(Ex. 1). Since the preterminal and polymerase proteins are known to physically interact with 
each other [Zhao and Padmanabhan (1988) Cell 55:1005], we investigated whether the 
expression of the Ad-polymerase could overcome the replication defect of HSsublOO at non- 
permissive temperatures using the following replication-complementation assay. 

LP-293. B-6. or C-7 cells were seeded onto 60 mM dishes at a density of 2xl0 6 cells 
per dish and infected the next day with HSsublOO at a multiplicity of infection (MOI) of 0.25, 
and incubated at 38.5°C for 16 hours or 32.0°C for 40 hours. The cells from each infected 
plate were then harvested and total DNA extracted as described in Example 1. Four 
micrograms of each DNA sample was digested with Hindlll electrophoresed through a 0.7% 
agarose gel. transferred to a nylon membrane, and probed with 32 P-labeled H5ts36 virion 
DNA. The resulting autoradiograph is shown in Figure 7. As seen in Figure 7. the 
HSsublOO replication defect when grown in LP-293 cells at 38.5°C is seen; this defect is not 
present when the virus is grown at the same temperature in either B-6 or C-7 cells. 

The results depicted in Figure 7 demonstrates that both cell lines B-6 and C-7 could 
trans-complement the replication defect of HSsublOO. This result demonstrated that the 
expression of the Ad polymerase in B-6 cells was able to overcome the preterminal protein- 
mediated replication defect of H5subl00. While not limiting the present invention to any 
particular mechanism, the ability of Ad polymerase to overcome the preterminal protein- 
mediated replication defect of H5*wM00 may be due to a direct physical interaction of the 
polymerase with the amino-terminus of the H5st/M00-derived preterminal protein. 
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In support of this hypothesis, it has also been demonstrated that the FbA«&100 
replication defect can be overcome when LP-293 cells were infected with a 100-fold greater 
amount of H5*iM00. However, complementation of the H5*«MO0 replication defect is not 
sufficient to overcome the growth defect of H5sm6100. since temperature shift-up experiments 
have demonstrated that the H5i»M00 growth defect is not dependent upon viral replication 
[Schaack et al. (1995). supra]. Therefore, the overexpression of the Ad-polymerase must 
have allowed a very low level but detectable production of infectious H5.v«M00 particles in 
the B-6 cell line. The reduced growth of HSsublW is therefore not due to a replication 
defect, but rather some other critical activity that the preterminal protein has a role in. such as 
augmention of viral transcription by association with the nuclear matrix [Schaack and Shenk 
(1989) Curr. Top. Microbiol. Immunol. 144:185 and Hauser and Chamberlain (1996) J. Endo. 
149:373]. This was confirmed by assessing the ability of the C-7 cell line to overcome the 
growth defect of H5sub\Q0 utilizing one-step growth assays performed as follows. 

Each of the cell lines (LP-293. B-6 and C-7) were seeded onto 60 mm dishes at 2.0 x 
10 s cells/dish. The cell lines were infected at an MOI of 4 with each of the appropriate 
viruses (wtAd5. H5ts36, or H5«/M00), and incubated at 38.5°C for 40 hours. The total 
amount of infectious virions produced in each 60 mm dish was released from the cell lysates 
bv three cvcles of freeze-thawing, and the titer was then determined by limiting dilution and 
plaque assay on B-6 cells at 38.5°C. The results are summarized in Figure 8. 

The results shown in Figure 8. demonstrated that even though the B-6 cell line allowed 
normal replication and plaque formation of H5*«M00 at 38.5°C (in fact. B-6 cells were 
utilized to determine the plaque litres depicted in Figure 8) they could not allow high level 
«rowth of HSraMOO and only produced litres of HfcwMOO equivalent to that produced by the 
LP-293 cells. The C-7 cell line produced 100 fold more virus than the LP-293 or B-6 cells, 
see Ficure 8. Encouragingly, the litre of H5s«6100 produced by the C-7 cells approached 
litres produced by LP-293 cells infected with wild-type virus. Ad5. When the H5™M00 
virions produced from infection of the C-7 cells were used to infect LP-293 cells at 38.5°C, 
all virus produced retained the a mutation {i.e.. at least a 1000 fold drop in pfu was detected 
when LP-293 cells were respectively infected at 38.5°C vs. 32.0°C). This finding effectively 
I rules out the theoretical possibility that the H5swM00 input virus genomes recombined with 
the preterminal protein sequences present in the C-7 cells. 

In addition, the C-7 cell line allowed the high level growth of H5ts36. demonstrating 
that adequate amounts of the Ad-polymerase protein were also being expressed. The C-7 cell 
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line was capable of trans-complementing the growth of both H5ts36 and HSsublOO after 4 
months of serial passaging, demonstrating that the coexpression of the El. preterminal, and 
polymerase proteins was not toxic. 

These results demonstrate that the constitutive expression of both the polymerase and 
preterminal proteins is not detrimental to normal virus production, which might have occurred 
if one or both of the proteins had to be expressed only during a narrow time period during the 
Ad life cycle. In summary, these results demonstrated that the C-7 cell line can be used as a 
packaging cell line to allow the high level growth of El, preterminal, and polymerase deleted 
Ad vectors. 



EXAMPLE 3 

Production Of Adenovirus Vectors 
Deleted For El And Polvmerase Functions 



In order to produce an Ad vectors deleted for El and polymerase functions, a small, 
frame-shifting deletion was introduced into the Ad-pol gene contained within an El -deleted 
Ad genome. The plasmid pBHGll (Microbix) was used as the source of an El -deleted Ad 
genome. pBHGll contains a deletion of Ad5 sequences from bp 188 to bp 1339 (0.5-3.7 
m.u.): this deletion removes the packaging signals as well as El sequences. pBHGl 1 also 
contains a large deletion within the E3 region (bp 27865 to bp 30995; 77.5-86.2 m.u.). The 
nucleotide sequence of pBHGll is listed in SEQ ID NO:4 [for cross-corrleation between the 
pBHGI 1 sequence and the Ad5 genome (SEQ ID NO;l), it is noted that nucleotide 8,773 in 
pBHGI 1 is equivalent to nucleotide 7,269 in Ad5]. 

pBHGl 1 was chosen to provide the Ad backbone because this plasmid contains a large 
deletion within the E3 region (77.5 to 86.2 m.u.) and therefore vectors derived from this 
plasmid permit the insertion of large pieces of foreign DNA. A large cloning capacity is 
important when the pol" vectors is to be used to transfer a large gene such as the dystrophin 
gene (cDNA = 13.6 kb). However, the majority of genes are not this large and therefore 
other Ad backbones containing smaller deletions within the E3 region (e.g., pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pol" vector's using the strategy outlined below. 
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a) Construction Of A Plasmid Containing A Portion Of The 
Adenovirus Genome Containing The Polymerase Gene 

A fragment of the Ad genome containing the pol gene located on pBHGll was 
subcloned to create pBSA-XB. Due to the large size of the Ad genome, this intermediate 
plasmid was constructed to facilitate the introduction of a deletion within the pol gene the pol 
deletion. pBSA-XB was constructed as follows. The polylinker region of pBluescript 
(Stratagene) was modified to include additional restriction enzyme recognition sites (the 
sequence of the modified polylinker is provided in SEQ ID NO:5; the remainder of 
pBluescnpt was not altered); the resulting plasmid was termed pBSX. pBHGll was digested 
with Xbal and BamHl and the 20.223 kb fragment containing the pol and pTP codtng regtons 
(E2b region) was inserted into pBSX digested with Xbal and BamHl to generate P BSA-XB. 

b) Construction Of pBHG 1 1 Apol 

A deletion was introduced into the pol coding region contained within pBSA-XB in 
such a manner that other key viral elements were not disturbed (e.g., the major late promoter, 
the tripartite leader sequences, the pTP gene and other leader sequences critical for normal 
virus viability). The deletion of the pol sequences was carried out as follows. P BSA-XB was 
digested with BspEl and the ends were filled in using T4 DNA polymerase. The BspEl- 
diuested T4 polymerase filled DNA was then digested with BamHl and the 8809 bp 
^HI/^EKfillcd) fragment was isolated as follows. The treated DNA was run on a 0.6% 
agarose gel (TAE buffer) and the 8809 bp fragment was excised from the gel and purified 
using a QIAEX Gel Extraction Kit according to the manufacturer's instructions (OIAGEN, 
Chatsworth. CA). 

A second aliquot of pBSA-XB DNA was digested with BspHl and the ends were filled 
in with T4 DNA polvmerase. The B S pHI-digested. T4 polymerase filled DNA was then 
digested with BamHl and the 13.679 bp BamHVBspHWW fragment was isolated as 
described above. 

The purified 8809 bp BamHVBspEWM) fragment and the purified 13.679 bp 
BammiBspWmOl fragment were ligated to generate pApol. pApol contains a 612 bp 
, deletion within the pol gene (bp 8772 to 9385; numbering relative to that of pBHGll) and 
lacks the 1 1 .4 kb BamHl fragment containing the right arm of the Ad genome found wUhtn 
• pBHGll. 
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To provide the right arm of the Ad genome. pApol was digested with BamHl followed 

by treatment with calf intestinal alkaline phosphatase. pBHGl 1 was digested with BamHl and 
the 11.4 kb fragment was isolated and purified using a QIAEX Gel Extraction Kit as 
described above. The purified 1 1.4 kb BamHl fragment was ligated to the 
flawHl/phosphatased pApol to generate pBHGl 1 Apol. Proper construction of pBHGl 1 Apol 
was confirmed restriction digestion (//mdlll). 

c) Rescue And Propagation Of AdSApol Virus 

The Ad genome contained within pBHGl lApol lacks the packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl 1 Apol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAElsplA. pAElsplB (Microbix) or pAdBglll (pAdBglll is a standard shuttle vector which 
contains 0-1 m.u. and 9-16 m.u. of the adenovirus genome). 

To rescue virus. pBHGHApol was co-transfected with Ad5*//7001 viral DNA . 
Ad5c//700 1 contains a deletion in the E3 region; the E3 deletion contained within Ad5rf/700 1 
is smaller than the deletion contained within the Ad genome contained within pBHGl 1. It is 
not necessary that Ad5c//7001 be used to recover virus; other adenoviruses, including wild- 
type adenoviruses, may be used to rescue of virus from pBHGl lApol. 

It has been reported that the generation of recombinant Ads is more efficient if Ad 
DNA-terminal protein complex (TPC) is employed in conjunction with a plasmid containing 
the desired deletion [Miyake et al (1996) Proc. Natl. Acad. Sci. USA 93:1320]. Accordingly, 
Ad5dl7001-TPC were prepared as described [Miyake et al (1996). supra]. Briefly, purified 
Ad5 J/7001 virions (purified through an isopycnic CsCl gradient centered at 1.34 g/ml) were 
lysed by the addition of an equal volume of 8 M guanidine hydrochloride. The released 
Ad5c//7001 DNA-TPC was then purified through a buoyant density gradient of 2.8 M CsCl/4 
M guanidine hydrochloride by centrifugation for 16 hr at 55.000 rpm in a VTi65 rotor 
(Beckman). Gradient fractions containing Ad5t#7001 DNA-TPC were identified using an 
ethidium bromide spot test and then pooled, dialyzed extensively against TE buffer. BSA was 
then added to a final concentration of 0.5 mg/ml and aliquots were stored at -80°C. The 
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Ad5c//7001 DNA-TPC was then digested with Seal and then gel-filtered through a Sephadex 
G-50 spin column. 

One hundred nanograms of the digested Ad5c//7001 DNA-TPC was mixed with 5 ug 
of pBHGHApol and used to transfect pol-expressing 293 cells {i.e.. C-7). Approximately 10 
days post-transfection. plaques were picked. The recombinant viruses were plaque purified 
and propagated using standard techniques (Graham and Prevac. supra). Viral DNA was 
isolated, digested with restriction enzymes and subjected to Southern blotting analysis to 
determine the organization of the recovered viruses. Two forms of virus containing the 612 
bp pol deletion were recovered and termed Ad5ApolAE3I and Ad5ApolAE3II. One form of 
recombinant pol' virus recovered. Ad5ApolAE3I. underwent a double recombination event 
with the Ad5c//7001 sequences and contains the E3 deletion contained within Ad5d/7001 at 
the right end of the genome. The second form of recombinant pol' virus. Ad5ApolAE3II. 
retained pBHGl 1 sequences at the right end of the genome (/.c>. contained the E3 deletion 

found within pBHGl 1). These results demonstrate the production of a recombinant Ad vector 

containing a deletion within the pol gene. 

d) Characterization Of The E1+, pol- Viruses 

To demonstrate that the deletion contained within these two pol" viruses renders the 
virus incapable of producing functional polymerase. Ad5ApolAE3I was used to infect 293 
cells and pol-expressing 293 cells (B-6 and C-7 cell lines) and a viral replication- 
complementation assay was performed as described in Example le. Briefly. 293. B-6 and C-7 
cells were seeded onto 60 mm dishes at a density of 2 x 106 cells/dish and infected with 
R5sub\00 or Ad5ApolAE31 at an MOI of 0.25. The infected cells were then incubated at 
37°C or 38.5°C for 16 hours, or at 32.0°C for 40 hours. Cells from each infected plate were 
then harvested and total DNA was extracted. Four micrograms of each DNA sample was 
digested with /flndlll, electrophoresised through an agarose gel. transferred to a nylon 
membrane and probed with 3: P-labeled adenoviral DNA. The resulting autograph is shown in 
Figure 9. In Figure 9, each panel shows, from left to right, DNA extracted from 293, B-6 
and C-7 cells, respectively infected with either H5*wM00 or Ad5ApolAE3I (labeled 

AdSAPOL in Fig. 9). 

As shown in Figure 9. the recombinant pol" virus was found to be viable on pol- 
expressing 293 cells but not on 293 cells. These results demonstrates that recombinant Ad 
viruses containing the 612 bp deletion found within pApol lack the ability to express Ad 
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polymerase. These results also demonstrate that B-6 and C-7 cells efficiently complement the 
Apol found within Ad5ApolAE3I and Ad5ApolAE3II. In addition, these results show that 
replication of the ts pTP mutant H5sub\Q0 can be complemented by high level expression of 
the Ad polymerase with or without co-expression of pTP. 

Because the expression of early genes is required for the expression of the late gene 
products, the ability of recombinant viruses contaning the Apol deletion to direct the 
expression of late gene products was examined. 293 and C-7 cells were infected with 
Ad5ApolAE3I and 24 hours after infection cell extracts were prepared. The cell extracts were 
serially diluted and examined for the expression of the fiber protein (a late gene product) by 
immunoblot analysis. The immunoblot was performed as described in Example 3C with the 
exception that the primary antibody used was an anti-fiber antibody (FIBER-KNOB obtained 
from Robert Gerard. University of Texas Southwestern Medical School). The results of this 
immunoblotting analysis revealed that no fiber protein was detected from 293 cells infected 
with Ad5ApolAE3I (at any dilutuion of the cell extract). In contrast, even a 1:1000 dilution 
15 of cell extract prepared from C-7 cells infected with Ad5ApolAE3I produced a visible band 
on the immunoblot. Therefore, the pol deletion contained within Ad5ApolAE3I resulted in a 
greater than 1 000-fold decrease in fiber production. 

The above results demonstrate that polymerase gene sequences can be deleted from the 
virus and that the resulting deleted virus will only grow on cells producing Ad-polymerase in 
20 trans. Using the pol-expressing cell lines described herein (e.g.. B-6 and C-7). large 

quantities of the pol" viruses can be prepared. A dramatic shut-down in growth and late gene 
expression is seen when cells which do not express Ad polymerase are infected with the pol" 
viruses. 

25 e) Generation Of El", Pol" Ad Vectors 

Ad5c//7001 used above to recuse virus containing the polymerase deletion is an El - 
containing virus. The presence of El sequences on the recombinant pol" viruses is 
undesirable when the recombinant virus is to be used to transfer genes into the tissues of 
animals: the El region encodes the transforming genes and such viruses replicate extremely 
30 well in vivo leading to an immune response directed against cells infected with the El- 
containine virus. 

w 

El" viruses containing the above-described polymerase deletion are generated as 
follows. pBHGHApol is cotransfected into pol-expressing 293 cells (e.g.. B-6 or C-7) along 
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with a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAElsplA (Microbix). pAElsplB (Microbix) or pAdBglll. 
The «ene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along witlipBHGHApol to generate a 
recombinant El\ pol" Ad vector containing the gene of interest. 



EXAMPLE 4 

Production Of Adenovirus Vectors 
Deleted For El And Preterminal Protein Functions 



In order to produce an Ad vectors deleted for El and preterminal protein functions, a 
small deletion was introduced into the Ad preterminal protein (pTP) gene contained within an 
El-deleted Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El- 
deleted Ad genome to maximize the cloning capacity of the resulting pTP" vector. However, 
other Ad backbones containing smaller deletions within the E3 region (e.g.. pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pTP~ vectors using the strategy outlined below. 

a) Construction Of pApTP 

A deletion was introduced into the pTP coding region contained within pBSA-XB (Ex. 
3) in such a manner that other key viral elements were not disturbed (e.g.. the tripartite leader 
sequences, the i-leader sequences, the VA-RNA I and II genes, the 55 kD gene and the pol 
gene). The deletion of the pTP sequences was carried out as follows. pBSA-XB was 
digested with Xbal and EcoRV and the 7.875 kb fragment was isolated as described (Ex. 3). 
Another aliquot of pBSA-XB was digested with Muni and the ends were filled in using T4 
DNA polymerase. The MwwI-digested, T4 polymerase filled DNA was then digested with 
Xbal and the 14.894 kb Xbal/Munl( filled) fragment was isolated as described (Ex. 3). The 
7.875 kb Muni fragment and the 14.894 kb XballMunli filled) fragment were ligated together 
to generate pApTP. 
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b) Construction Of pBHGllApTP 

pAppl contains a 429 bp deletion within the pTP gene (bp 10.705 to 1 1.134: 
numbering relative to that of pBHGl 1) and lacks the 1 1.4 kb BamHl fragment containing the 
right arm of the Ad genome found within pBHGll. 

To provide the right arm of the Ad genome. pApTP was digested with BamHl 
followed by treatment with shrimp alkaline phosphatase (SAP: U.S. Biochemicals. Cleveland, 
OH). pBHGl 1 was digested with BamHl and the 1 1.4 kb fragment was isolated and purified 
using a QIAEX Gel Extraction Kit as described (Ex. 3). The purified 1 1.4 kb BamHl 
fragment was ligated to the BamHI/phosphatased pApTP to generate pBHGl 1 ApTP. Proper 
construction of pBHGl 1 ApTP was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pTP 
Deletion 

The Ad genome contained within pBHGllApTP lacks the Ad packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol, this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAElsplA. pAElsplB (Microbix) or pAdBglll. 

Recombinant Ad vectors containing the pTP deletion which contain a deletion within 
the E3 region are generated by cotransfection of pBHGl lApTP (the gene of interest is 
inserted into the unique Pad site of pBHGl lApTP) with a E3-deleted Ad virus such as 
Ad5dl700 into pTP-expressing 293 cells (e.g., C-7); viral DNA-TPC are utilized as described 
above in Example 3. 

Recombinant vectors containing the pTP deletion which also contain deletions within 
the El and E3 regions are generated by cotransfection of pBHGllApTP into pTP-expressing 
293 cells (e.g.. C-7) along with a shuttle vector containing the left-end Ad5 sequences 
including the packaging signals. Suitable shuttle vectors include pAElsplA (Microbix), 
pAElsplB (Microbix) or pAdBglll. The gene of interest is inserted into the polylinker region 
of the shuttle vector and this plasmid is then cotransfected into B-6 or C-7 cells along with 
pBHGl lApTP to generate a recombinant El", pTP" Ad vector containing the gene of 
interest. 
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EXAMPLE 5 

Production Of Adenovirus Vectors Deleted For 
EL Polymerase And Preterminal Protein Functions 



In order to produce an Ad vectors deleted for El. polymerase and preterminal protein 
functions, a deletion encompassing pol and pTP gene sequences was introduced into the Ad 
sequences contained within an El -deleted Ad genome. The plasmid pBHGl 1AE4 was used 
as the source of an El -deleted Ad genome to maximize the cloning capacity of the resulting 
pol". pTP" vector. pBHGl 1AE4 is a modified form of BHG1 1 which contains a deletion of 
all E4 genes except for the E4 ORF 6; the E4 region was deleted to create more room for the 
insertion of a gene of interest and to further disable the virus. However, other EP Ad 
backbones, such as pBHGll and pBHGl (Microbix; pBHGlO contains a smaller deletion 
within the E3 region as compared to pBHGll), may be employed for the construction of 
pol". pTP" vectors using the strategy outlined below. 

Due to the complexity of the cloning steps required to introduce a 2.3 kb deletion that 
removes portions of both the pTP and pol genes, this deletion was generated using several 
steps as detailed below. 

a) Construction Of P AXBApolApTPVARNA+tl3 

In order to create a plasmid containing Ad sequences that have a deletion within the 
pol and pTP genes, pAXBApolApTPVARNA+tl3 was constructed as follows. 

pBS A-XB was digested with BspEl and the 1 8 kb fragment was isolated and 
recircularized to create pAXBApolApTP; this plasmid contains a deletion of the sequences 
contained between the BspEl sites located at 8.773 and 12.513 (numbering relative to 
5 pBHGll). 

A fragment encoding the VA-RNA3 sequence and the third leader of the tri-partite 
leader sequence was prepared using the PCR as follows. The PCR was carried out in a 
solution containing H5ts36 virion DNA (any Ad DNA, including wild-type Ad. may be used), 
2 ng/mL of primers 4005E and 4006E. 10 mM TrisHCl, pH 8.3. 50 mM KCL 1.5 mM 
0 MgCK. 0.001% gelatin and Pfu polymerase. The forward primer. 4005E, [5'- 

TGCCGCAGCACCGGATGCATCo* (SEQ ID NO:6)] contains sequences complementary to 
residues 12.551 to 12.571 of pBHGll (SEQ ID NO:4). The reverse primer. 4006E, [5'- 
GCG TCCGGA GGCTGCCATG CGGCAGGGo (SEQ ID NO:7)] is complementary to 
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residues 1 1.091 to 1 1.108 of pBHGl 1 (SEQ ID NO:4) as well as a BspEl site (underlined). 
The predicted sequence of the M.6 kb PCR product is listed in SEQ ID NO:8. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94°C for 3 min. 3 cycles of denaturation at 94°C 
for 30 sec. annealing at 50°C for 30 sec. and extension at 72°C for 60 sec. followed by 
another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72°C for 10 minutes. The -1.6 kb PCR product was purified using a QIAEX Gel Extraction 
Kit as described (Ex. 3). 

The purified PCR fragment was then digested with BspEL pAXBApolApTP was 
digested with BspEl followed by treatment with SAP. The AspEI-digested PCR fragment and 
the fi.vpEI-SAP-treated pAXBApolApTP were ligated together to create 
pAXB Apol ApTP V ARN A+t 13. 



b) Construction Of pBHGllApolApTPVARNA+tl3 

pAXBApolApTPVARNA+tl3 contains a 2.3 kb deletion within the pol and pTP genes 
(bp 8.773 to 1 1,091; numbering relative to that of pBHGl 1) and lacks the 1 1.4 kb BamHl 
fragment containing the right arm of the Ad genome. 

To provide the right arm of the Ad genome, pAXBApolApTPVARNA+tl3 was 
digested with BamHl followed by treatment with SAP. pBHGl 1AE4 (pBHGl 1 or pBHGlO 
may be used in place of pBHGl 1AE4) was digested with BamHl and the 1 1.4 kb fragment 
was isolated and purified using a QIAEX Gel Extraction Kit as described (Ex. 3). The 
purified 1 1 .4 kb BamHl fragment was ligated to the flamHI/phosphatased 
pAXBApolApTPVARNA+tl3 to generate pBHGl 1 Apol ApTP V ARN A+t 13. Proper 
construction of pBHGl 1 Apol ApTP V ARN A+t 13 was confirmed restriction diuestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pol, 
pTP Double Deletion 

The Ad genome contained within pBHGl 1 Apol ApTP V ARN A+t 13 lacks the Ad 
packaging signals. Therefore, in order to recover virus containing the 2.3 kb deletion within 
the pol and pTP genes from pBHGl 1 Apol ApTP V ARN A+t 13. this plasmid must be 
cotransfected into packaging cells along with DNA that provides a source of the Ad 
packaging signals. The Ad packaging signals may be provided by wild-type or mutant Ad 
viral DNA or alternatively may be provided using a shuttle vector which contains the left-end 
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Ad5 sequences including the packaging signals such as pAElsplA. pAElsplB (Microbix) or 
pAdBglll. 

Recombinant vAd vectors containing the pol, pTP double deletion which also contain a 
deletion within the E3 region are generated by cotransfection of 

pBHGl 1 ApolApTPVARNA+tl3 (the gene of interest is inserted into the unique Pad site of 
P BHGllApolApTPVARNA+tl3) with a E3-deleted Ad virus such as Ad5rf/7001 into pol- and 
pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described (Ex. 3). 

Recombinant vectors containing the pol, pTP double deletion which also contain 
deletions within the El and E3 regions are generated by cotransfection of 

pBHGl 1 ApolApTPVARNA+tl3 into pol- and pTP-expressing 293 cells (e.g.. C-7) along with 

a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 

Suitable shuttle vectors include pAElsplA (Microbix), pAElsplB (Microbix) or pAdBglll. 

The gene of interest is inserted into the polylinker region of the shuttle vector and this 

plasmid is then cotransfected into B-6 or C-7 cells along with 

pBHGllApolApTPVARNA+tl3 to generate a recombinant El", pol ". pTP" Ad vector 
containing the gene of interest. 

EXAMPLE 6 

Encapsidated Adenovirus Minichromosomes 
Containing A Full Length Dystrophin cDNA 

In this Example, the construction of an encapsidated adenovirus minichromosome 
(EAM) consisting of an infectious encapsidated linear genome containing Ad origins of 
replication, packaging signal elements, a p-galactosidase reporter gene cassette and a full 
length (14 kb) dystrophin cDNA regulated by a muscle specific enhancer/promoter is 
described. EAMs are generated by cotransfecting 293 cells with supercoiled plasmid DNA 
( P Ad5pdys) containing an embedded inverted origin of replication (and the remaining above 
elements) together with linear DNA from El -deleted virions expressing human placental 
alkaline phosphatase (hpAP). All proteins necessary for the generation of EAMs are provided 
in trans from the hpAP virions and the two can be separated from each other on equilibrium 
CsCl aradients. These EAMs are useful for gene transfer to a variety of cell types both in 
vitro and in vivo. 
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a) Generation And Propagation Of Encapsidated Adenovirus 
Minichromosomes 

To establish a vector system capable of delivering full length dystrophin cDNA clones, 
the minimal region of Ad5 needed for replication and packaging was combined with a 
conventional plasmid carrying both dystrophin and a p-galactosidase reporter gene. In the 
first vector constructed, these elements were arranged such that the viral ITRs flanked (ITRs 
facing outward) the reporter gene and the dystrophin gene [i.e.. the vector contained from 5* 
to 3* the right or 3 ? ITR (mu 100 to 99). the dystrophin gene and the reporter gene and the 
left or 5" ITR and packaging sequnce (mu 1 to 0)]. Upon introduction of this vector along 
with helper virus into 293 cells, no encapsidated adenovirus minichromosomes were 
recovered. 

The second and successful vector, pAd5pdys (Fig. 10) contains 2.1 kb of adenovirus 
DNA. together with a 14 kb murine dystrophin cDNA under the control of the mouse muscle 
creatine kinase enhancer/promoter, as well as a P-galactosidase gene regulated by the human 
cytomegalovirus enhancer/promoter. 

Figure 10 shows the structure of pAdSpdys (27.8 kb). The two inverted adenovirus 
origins of replication are represented by a left and right inverted terminal repeat 
(LITR/RITR). Replication from these ITRs generates a linear genome whose termini 
correspond to the 0 and 1 00 map unit (mu) locations. Orientation of the origin with respect 
to wild type adenovirus serotype 5 sequences in mu is indicated above the figure (1 mu=360 
bp). Encapsidation of the mature linear genome is enabled by five (Al-AV) packaging signals 
(4*). The E. coli P-galactosidase and mus musculus dystrophin cDNAs are regulated by 
cytomegalovirus (CMV) and muscle creatine kinase (MCK) enhancer/promoter elements, 
respectively. Both expression cassettes contain the SV40 polyadenylation (pA) signal. Since 
the El A enhancer/promoter overlaps with the packaging signals, pAd5pdys was engineered 
such that RMA polymerase transcribing from the El A enhancer/promoter will encounter the 
SV40 late polyadenylation signal. Pertinent restriction sites used in constructing pAdSpdys 
are indicated below the figure. An adenovirus fragment corresponding to mu 6.97 to 7.77 
was introduced into pAdSpdys during the cloning procedure (described below). PI and P2 
represent location of probes used for Southern blot analysis. Restriction sites destroyed 
during the cloning of the Ad5 origin of replication and packaging signal are indicated in 
parentheses. 
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pAdSpdys was constructed as follows. pBSX (Ex. 3a) was used as the backbone for 
construction of P Ad5pdys. The inverted fused Ad5 origin of replication and five 
encapsidation signals were excised as a PsAIXbA fragment from pAdSori. a plasmid 
containing the 6 kb HmAWl fragment from pFG140 (Microbix Biosystems). This strategy also 
introduced a 290 bp fragment from Ad5 corresponding to map units 6.97 to 7.77 adjacent to 
the right inverted repeat (see Fig. 10). The E. coli p-galactosidase gene regulated by the 
human CMV immediate early (IE) promoter/enhancer expression cassette was derived as an 
EcoRVHindlll fragment from pCMVp [MacGregor and Caskey (1989) Nucleic Acids Res. 
17:2365]: the CMV IE enhancer/promoter is available from a number of suppliers as is the E. 
coli p-galactosidase gene]. The murine dystrophin expression cassette was derived as a 
BssHll fragment from pCVAA. and contains a 3.3 kb MCK promoter/enhancer element 
[Phelps et al. (1995) Hum. Mol. Genet. 4:1251 and Jaynes et al. (1986) Mol. Cell. Biol. 
6:2855]. The sequence of the -3.3 kb MCK promoter/enhancer element is provided in SEQ ID 
NO:9 (in SEQ ID NO:9. the last nucleotide of SEQ ID NO:9 corresponds to nucleotide +7 of 
the MCK gene). The enhancer element is contained within a 206 bp fragment located -1256 
and -1050 upstream of the transcription start site in the MCK gene. 

It was hoped that the P Ad5pdys plasmid would be packageable into an encapsidated 
minichromosome when grown in parallel with an El-deleted virus due to the inclusion of both 
inverted terminal repeats (ITR) and the major Ad packaging signals in the plasmid. The ITRs 
and packaging signals were derived from pFG140 (Microbix). a plasmid that generates El- 
defective Ad particles upon transfection of human 293 cells. 

hpAP is an El deleted Ad5 containing the human placental alkaline phosphatase gene 
[Muller et al. (1994) Circ. Res. 75:1039]. This virus was chosen to provide the helper 
functions so that it would be possible to monitor the titer of the helper virus throughout serial 
passages by quantitative alkaline phosphatase assays. 

293 cells were cotransfected with P Ad5pdys and hpAP DNA as follows. Low passage 
293 cells (Microbix Biosystems) were grown and passaged as suggested by the supplier. Five 
P Ad5pdys and hpAP DNA (5 and 0.5 ug, respectively) were dissolved in 70 ul of 20 mM 
HEPES buffer (pH 7.4) and incubated with 30 pi of DOTAP (BMB) for 15 min. at room 
temperature. This mixture was resuspended in 2 mis of DMEM supplemented with 2% fetal 
calf serum (FCS) and added dropwise to a 60 mm plate of 293 cells at 80% confluency. Four 
hours post-transfection the media was replaced by DMEM with 10% FCS. Cytopathic effect 
was observed 6-12 days post-transfection. 
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Cotransfection of 293 cells with supercoiled pAdSpdys and linear hpAP DNA 
produced AdSpdys EAMs (the encapsidated version of pAdipdys) approximately 6 to 12 days 
post transfection. as evidenced by the appearance of a cytopathic effect (CPE). Initially, the 
amount of AdSpdys EAMs produced was significantly lower than that of hpAP virions so the 
viral suspension was used to re-infect fresh cultures, from which virus was isolated and used 
for serial infection of additional cultures (Fig. 1 1). Infection and serial passaging of 293 cells 
was carried out as follows. 

Lysate from one 60 mm plate of transfected 293 cells was prepared by vigorously 
washing the cells from the plate and centrifuging at 1 K rpm in a clinical centrifuge. Cells 
were resuspended in DMEM and 2% FCS. freeze-thawed in a dry ice-ethanol bath, cell debris 
removed by centrifugation, and approximately 75% of the crude lysate was used to infect 293 
cells in DMEM supplemented with 2% FCS for 1 hour and then supplemented with 10% FCS 
thereafter. Infection was allowed to proceed for 18-20 hrs before harvesting the virus. The 
total number of cells infected in each serial passage is indicated in Figure 11. 

In Figure 11. the total number of transducing adenovirus particles produced (output) 
per serial passage on 293 cells, total input virus of either the helper (hpAP) or Ad5pdys, and 
the total number of cells used in each infection is presented. The total number of input/output 
transducing particles were determined by infection of 293 cells plated in 6-well microtiter 
plates. Twenty four hours post-infection the cells were assayed for alkaline phosphatase or P- 
galactosidase activity (described below) to determine the number of cells transduced with 
either Ad5pdys or hpAP. The number of transducing particles were estimated by 
extrapolation of the mean calculated from 31 randomly chosen 2.5 mm 2 sectors of a 961 mm : 
plate. The intra-sector differences in total output of each type of virus are presented as the 
standard deviations, a, in Figure 11. For each serial passage. 75% of the total output virus 
from the previous passage was used for infection. 

Alkaline phosphatase or P-galactosidase activity was determined as follows. For 
detection of alkaline phosphatase, infected 293 cells on Petri dishes were rinsed twice with 
phosphate buffered saline (PBS) and fixed for 10 minutes in 0.5% glutaraldehyde in PBS. 
Cells were again rinsed twice with PBS for ten minutes followed by inactivation of 
endogenous alkaline phosphatase activity at 65°C for 1 hr. in PBS prior to the addition of the 
chromogenic substrate BCIP (5-bromo-4-chloro-3-indolyl phosphate) at 0.15 mg/ml and nitro 
blue tetrazolium at 0.3 mg/ml) . Cells were incubated at 37°C in darkness for 3- 24 hrs. For 
P-galactosidase assays, the cells were fixed and washed as above, then assaved usine standard 
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methods [MacGregor et ai (1991) In Murray (ed.). Methods in Molecular Biology . Vol ?: 
Gene Transfer and Expression Protocols, Human Press Inc.. Clifton. NJ, pp. 217-225]. 

As shown in Fieure 1 1. the rate of increase in titer of Ad5pdys EAMs between 
transfection and serial passage 6 was approximately 100 times greater than that for hpAP 
virions. This result indicated that Ad5pdys has a replication advantage over the helper virus, 
probably due to the shorter genome length (a difference of approximately 8 kb) and hence an 

increased rate of packaging. 

Interestingly, after serial passage 6 there was a rapid decrease in the total titer of hpAP 
virions whereas the titer of AdSpdys EAMs continued to rise. While not limiting the present 
invention to any particular mechanism, at least two possible mechanisms could be responsible 
for this observation. Firstly, a buildup of defective hpAP virions due to infections at high 
multiplicities may slowly out-compete their full length counterparts, a phenomenon that has 
been previously observed upon serial propagation of adenovirus [Daniell (1976) J. Virol. 
19:685: Rosenwirth et ai (1974) Virology 60:431; and Buriingham et ai (1974) Virology 
60:419]. Secondly, the emergence of replication competent virions due to recombination 
events between El sequences in cellular DNA and the hpAP genome could lead to a buildup 
of virus particles defective in expressing alkaline phosphatase [Lochmuller at ai (1994) Hum. 
Gene Ther. 5:1485]. 

Southern analysis of DNA prepared from serial lysates 3. 6. 9 and 12 indicated that 
full length dystrophin sequences were present in each of these lysates (Fig. 12 A). In addition, 
the correct size restriction fragments were detected using both dystrophin and p-galactosidase 
probes against lysate DNA digested with several enzymes (Figs. 12A-B). 

Figure 12 shows a Southern blot analysis of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes BssiilL Nrul and £cc?RV. indicating the presence of a 
full length dystrophin cDNA in all lysates. Fragments from the C terminus of mus musculus 
dystrophin cDNA (A) or the N terminus of E coli P-galactosidase (B) were labeled with 
dCTP 32 and used as probes [Sambrook et ai, supra]. The position of these probes and the 
predicted fragments for each digest is indicated in Figure 10. Note that one end of each 
fragment (except the 17.8 kb BssHll dystrophin fragment) detected is derived from the end of 
the linearized Ad5pdys genome (see Fig. 10). Low levels of shorter products, presumably 
derived from defective virions, become detectable only at high serial passage number. 

At the later passages (9 and 12) there appeared to be an emergence of truncated 
AdSpdys sequences, suggesting that deletions and/or rearrangements may be occurring at later 
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passages. Hence, most experiments were performed with AdSpdys EAMs derived from the 
earlier passages. The possibility of the emergence of replication competent (El -containing) 
viruses was also examined by infection of HeLa cells by purified and crude serial lysates. 
none of which produced any detectable CPE. 

b) Purification Of Encapsidated Adenovirus Minichromosomes 
On CsCl Gradients 

The ability to separate AdSpdys EAMs from hpAP virions based on their buoyancy 
difference (due presumably to their different genome lengths) on CsCl gradients was 
examined. Repeated fractionation of the viral lysate allows small differences in buoyancy to 
be resolved. CsCl purification of encapsidated adenovirus minichromosomes was performed 
as follows. Approximately 25% of the lysate prepared from various passages during serial 
infections was used to purify virions. Freeze-thawed lysate was centrifuged to remove the 
cell debris. The cleared lysate was extracted twice with 1,1.2 tricholorotrifluoroethane 
(Sigma) and applied to CsCl step and self forming gradients. 

Purification of virus was initially achieved by passing it twice through CsCl step 
gradients with densities of p=1.45 and p=1.20 in a SW28 rotor (Beckman). After isolation of 
the major band in the lower gradient, the virus was passed through a self forming gradient 
(initial p=1.334) at 37,000 rpm for 24 hrs followed by a relaxation of the gradient by 
reducing the speed to 10,000 rpm for 10 hrs. in a SW41 rotor (Beckman) at 12°C [Anet and 
Strayer (1969) Biochem. Biophys. Res. Commun. 34:328]. The upper band from the gradient 
(composed mainly of AdSpdys virions) was isolated using an 18 gauge needle, reloaded on a 
fourth CsCl gradient (p-1.334) and purified at 37,000 rpm for 24 hrs followed by 10.000 rpm 
for 10 hrs at 12°C. 

The AdSpdys-containing CsCl band was removed in 100 )il fractions from the top of 
the centrifugation tube and CsCl was removed by chromatography on Sephadex G-50 
(Pharmacia). Aliquots from each fraction were used to infect 293 cells followed by p- 
galactosidase and alkaline phosphatase assays to quantitate the level of contamination by 
hpAP virions in the final viral isolate. 

Results of the physical separation between AdSpdys EAMs and hpAP virions are 
shown in Figure 13. Figure 13 shows the physical separation of Ad5pdys from hpAP virions 
at the third (A) and final (B) stages of CsCl purification (initial p=1.334) in a SW41 tube. 
Aliquots of AdSpdys EAMs from the final stage were drawn through the top of the 



51 



PCT/US97/19541 

WO 98/17783 

centrifugation tube and assayed for p-galactosidase and alkaline phosphatase expression. 
Results of these assays are presented in Figure 1 4. 

In Figure 14. shows the level of contamination of AdSPdys EAMs by hpAP virions 
obtained from passage 6. Following four cycles of CsCl purification, aliquots were removed 
from the top of the centrifugation tube and used for infection of 293 cells, which were fixed 
24 hrs later and assayed for p-galactosidase or alkaline phosphatase activity (described below). 
The number of transducing particles are an underestimation of the actual totals as in some 
cases a positive cell may have been infected by more than one transducing particle. The ratio 
of the two types of virions - Ad5pdys EAMs (LacZ) or hpAP (AP) in each fraction is 

indicated in the lower graph. 

The maximum ratio of transducing AdSpdys EAMs to hpAP virions reproducibly 
achieved in this study was 24.8 -- a contamination with helper virus corresponding to 
approximately 4% of the final viral isolate. 

c) Dystrophin And /J-galactosidase Expression By Encapsidated 
Adenovirus Minichromosomes In Muscle Cells 

To determine if the Ad5pdys EAMs were able to express p-galactosidase and 
dystrophin in muscle cells, mouse mdx myogenic cultures were infected with CsCl purified 
EAMs. 

i) Propagation And Infection Of Muscle Cells 

MM 14 and mdx myogenic cell lines were kindly provided by S. Hauschka (University 
of Washington) and were cultured as previously described [Linkhart el al. (1981) Dev. Biol. 
86:19 and Clegg et al. (1987) J. Cell Biol. 105:949]. Briefly, myoblasts were grown on 
plastic tissue culture plates coated with a 0.1% gelatin in Ham's F-10 medium containing 15% 
(v/v) horse serum, 0.8 mM CaCU, 200 ng/ml recombinant human basic fibroblast growh 
factor (b-FGF) and 60 ug/ml genitimicin (proliferation medium). Cutures were induced to 
differentiate by switching to growth in the presence of growth medium lacking b-FGF and 
containing 10% horse serum (differentiation medium). Myoblasts or differentiated myotubes 
(three days post switching) were infected at a multiplicity of infection of 2.2 Ad5pdys EAMs 
per cell. Fractions containing minimal contamination with hpAP virions (3. 4 and 5 of 
• passage 6) were used for western and immunofluorescence analysis. Infection was allowed to 
proceed for 3 days for both the myoblasts and myotubes before harvesting cells. 
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ii) Total Protein Extraction And Immunoblot 
Analysis 

For protein extraction, muscle cells were briefly trypsinized. transferred to a 
microcentrifuge tube, centrifuged at 14 K for 3 min at room temp and resuspended two times 
5 in PBS. After an additional centrifugation. the cell pellet was resuspended in 80 )il of RIPA 
buffer (50 mM Tris-Cl, pH 7.5; 150 mM NaCl; 1% Nonidet P-40: 0.5% sodium 
deoxycholate: 0.1% SDS) (Sambrook et aL supra). The sample was briefly sheared using a 
22 gauge needle to reduce viscosity and total protein concentration assayed using the 
bicinconinic acid protein assay reagent (Pierce, Rockford, IL). Expression of full length 

10 dystrophin or P-galactosidase in infected mdx and MM 14 myoblasts or myotubes was 

analyzed by electrophoresis of 40 jig of total protein extract on a 6% SDS-PAGE gel (in 25 
mM Tris. 192 mM glycine. 10 mM P-mercaptoethanol. 0.1% SDS). After transferring to 
Gelman Biotrace NT membrane (in 25 mM Tris. 192 mM glycine. 10 mM p-mercaptoethanol. 
0.05% SDS, 20% methanol), the membrane was blocked with 5% non-fat milk and 1% goat- 

15 serum in Tris-buffered saline-Tween (TBS-T) for 12 hrs at 4°C. Immunostaining was done 
according to the protocol for the ECL western blotting detection reagents (Amersham Life 
Sciences. Buckingham. UK). The primary antibodies used were Dys-2 (Vector Laboratories) 
and anti-p-galactosidase (BMB. Indianapolis, IN) with a horseradish peroxidase-conjugated 
anti-mouse secondary antibody. 

20 Western blot analysis of EAM-infected mdx myoblasts and myotubes (three days post- 

fusion) indicated that EAMs were able to infect both of these cell types (Figure 15). In 
Figure 15. immunoblots of protein extracts from mdx myoblasts and myotubes demonstrating 
the expression of p-galactosidase (A) and dystrophin (B) in cells infected with AdSpdys 
EAMs. Total protein was extracted 3 days post infection in all cases. Myotubes were 

25 infected at three days following a switch to differentiation media. In Figure 15 A, lane 1 

contains total protein extract from 293 cells infected with a virus expressing P-galactosidase 
as a control (RSV-LacZ); lanes 2-5 contain total protein extracts from uninfected mdx 
myoblasts, mdx myoblasts infected with Ad5pdys EAMs, mdx myotubes and mdx myotubes 
derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. In Figure 15B, 

30 lanes 1 and 7 contain total protein from mouse muscle ("C57") while lane 2 contains protein 
from wild type MM14 myotubes. as controls. Lanes 3-5 contain total protein extracts from 
uninfected mdx myoblasts, mdx myoblasts infected with AdSpdys EAMs, mdx myotubes and 
mdx myotubes derived from mdx myoblasts infected with AdSpdys EAMs. respectively. 
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As shown in Figure 15. expression of p-galactosidase was detected in both the infected 
mdx myoblasts and myotubes indicating that the CMV promoter was active at both these early 
staaes of differentiation in muscle cells. However, only infected mdx myotubes produced 
protein detected by dys-2. an antibody recognizing the 17 C-terminal amino acids of 
dystrophin (Fig. 1 5). No dystrophin expression was detected in infected myoblasts by western 
analysis, indicating that the muscle creatine kinase promoter functions minimally, if at all. 
within the Ad5Pdys EAM prior to terminal differentiation of these cells. 

Dystrophin expression in mdx cells infected with EAMs was confirmed by 
immunofluorescence studies using N-terminal dystrophin antibodies. In agreement with the 
western analysis, dystrophin expression from the MCK promoter was detected only in 
differentiated mdx myotubes infected by Ad5|Jdys EAMs (Fig. 16). Figures 16A-C show 
immunofluorescence of dystrophin in wild type MM 14 myotubes (A), uninfected mdx (B) and 
infected mdx myotubes (C). respectively. The results shown in Figure 16 demonstrate the 
transfer and expression of recombinant dystrophin to differentiated mdx cells by Ad5pdys 
EAMs. 

Immunofluorescence of myogenic cells was performed as follows. Approximately 1.5 
x 10" MM 14 or mdx myoblasts were plated on Poly-L-lysine (Sigma) coated glass slides (7 x 
3 cm) which had been previously etched with a 0.05% chromium potassium sulfate and 0.1% 
gelatin solution. For myotube analysis, the cultures were switched to differentiation media 
0 [Clegg et al. (1987), supra] 48 hours after plating, immediately infected and then allowed to 
fuse for 3 days, whereas myoblasts were continuously propagated in proliferation media 
[Clegg et al. (1987). supra]. Cells were washed three times with PBS at room temperature 
and fixed in 3.7% formaldehyde. For immunostaining. cells were incubated in 0.5% Triton 
X-100. blocked with 1% normal goat serum and incubated with an affinity purified antibody 
5 against the N-terminus of murine dystrophin for 2 hrs. followed by extensive washing in PBS 
and 0.1% Tween-20 with gentle shaking. Cells were incubated with a 1 :200 dilution of biotin 
conjugated anti-rabbit antibody (Pierce) for one hour and washed as above. Cells were 
further incubated with a 1:300 dilution of streptavidin-fluorescein isothiocyanate conjugate 
(Vectorlabs. Burlingam. CA) for one hour and washed as above, followed by extensive 

30 washing in PBS. 

The above results show that embedded inverted Ad origins of replication coupled to an 

encapsidation signal can convert circular DNA molecules to linear forms in the presence of 
helper virus and that these genomes can be efficiently encapsidated and propagated to high 
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titers. Such viruses can be purified on a CsCl gradient and maintain their ability to transduce 
cells in vitro, and their increased cloning capacity allows the inclusion of large genes and 
tissue specific gene regulatory elements. The above results also show that the dystrophin gene 
was expressed in cells transduced by such viruses and that the protein product was correctly 
localized to the cell membrane. The above method for preparing EAMs theoretically enables 
virtually any gene of interest to be inserted into an infectious minichromosome bv 
conventional cloning in plasmid vectors, followed by cotransfection with helper viral DNA in 
293 cells. This approach is useful for a variety of gene transfer studies in vitro. The 
observation that vectors completely lacking viral genes can be used to transfer a full-length 
dystrophin cDNA into myogenic cells indicates that this method may be used for the 
treatment of DMD using gene therapy. 

d) A Modified MCK Enhancer Increases Expression Of Linked 
Genes In Muscle 

The DNA fragment containing enhancer/promoter of the MCK gene utilized in the 
Ad5|Jdys EAM plasmid is quite large (-3.3. kb). In order to provide a smaller DNA fragment 
capable of directing high levels of expression of linked genes in muscle cells, portions of the 
3.3 kb MCK enhancer/promoter were deleted and/or modified and inserted in front of a 
reporter gene (/acZ). The enhancer element of the MCK gene was modified to produce the 
2RS5 enhancer: the sequence of the 2RS5 enhancer is provided in SEQ ID NO: 10. The first 
6 residues of SEQ ID NO: 10 represent a Kpnl site added for ease of manipulation of the 
modified MCK enhancer element. Residue number 7 of SEQ ID NO: 10 corresponds to 
residue number 2164 of the wild-type MCK enhancer sequence listed in SEQ ID NO:9 
(position 2164 of SEQ ID NO:9 corresponds to position -1256 of the MCK gene). Residue 
number 174 of SEQ ID NO: 10 corresponds to residue number 2266 of the wild-type MCK 
enhancer sequence listed in SEQ ID NO:9. 

These MCK/lacZ constructs were used to transfect cells in culture (/.«?.. myogenic 
cultures) or were injected as naked DNA into the muscle of mice and P-galactosidase activity 
was measured. Figure 1 7 provides a schematic of the MCK/lacZ constructs tested. The first 
construct shown in Figure 17 contains the 3.3. kb wild-type MCK enhancer/promoter 
fragment linked to the £ coli lacZ gene. The wild-type enhancer element (-1256 to -1056) 
is depicted by the box containing M E": the core promoter element (-358 to -80) is indicated by 
the light cross-hatching and the minimal promoter element (-80 to +7) is indicated by the dark 



55 



WO 98/17783 PCT/US97/19541 

cross-hatching. The core promoter element is required in addition to the minimal promoter 
element (which is required for basal expression) in order to achieve increased muscle-specific 
expression in tissue culture [Shield et ai (1996) Mol. Cell. Biol. 16:5058]. The modified 
enhancer element, the 2RS5 enhancer, is indicated by the box labeled "EV The box labeled 
"mi nx" contains a synthetic intron derived from adenovirus and is used to increase expression 
of the constructs (any intron may be utilized for this purpose). The box labeled "CMV" 
depicts the CMV IE enhancer/promoter which was used a positive control. 

In figure 17, p-galactosidase activity is expressed relative to the activity of the wild- 
type MCK enhancer/promoter construct shown at the top of the figure. As shown in Figure 
17, a construct containing the 2RS5 enhancer (SEQ ID NO: 10) linked to either the minimal 
MCK promoter (a - 261 bp element) or the core and minimal MCK promoter elements (a -539 
bp element) directs higher levels of expression of the reporter gene in muscle cells as 
compared to the -3.3 kb fragment containing the wild-type enhancer element. These modified 
enhancer/promoter elements are considerably smaller than the -3.3 kb fragment used in the 
AdSpdys EAM plasmid and are useful for directing the expression of foreign genes in muscle 
cells. These smaller elements are particularly useful for driving the expression of genes in the 
context of self-propagating adenoviral vectors which have more severe constraints on the 
amount of foreign DNA which can be inserted in comparison to the used of "gutted" 
adenoviruses such as the EAMs described above. 



EXAMPLE 7 

Generation Of High Titer Stocks Of Encapsidated Adenovirus 
Minichromosomes Containing Minimal Helper Virus Contamination 

The results presented in Example 6 demonstrated that encapsidated adenovirus 
minichromosomes (EAMs) can be prepared that lack all viral genes and which can express 
full-length dystrophin cDN As in a muscle specific manner. The propagation of these EAMs 
requires the presence of helper adenoviruses that contaminate the final EAM preparation with 
conventional adenoviruses (about 4% of the total preparation). In this example the EAM 
system is modified to enable the generation of high titer stocks of EAMs with minimal helper 
virus contamination. Preferably the EAM stocks contain helper virus representing less than 
1%. preferably less than 0.1% and most preferably less than 0.01% of the final viral isolate. 
Purified EAMs are then injected in vivo in muscles of dystrophin minus mdx mice to 
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determine whether these vectors lead to immune rejection and whether they can alleviate 
dystrophic symptoms in mdx muscles. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The second is to improve physical methods for separating EAM from 
helper virus. These approaches enable the generation of dystrophin-expressing EAMs that are 
contaminated with minimal levels of helper virus. 

« 

a) Development And Characterization Of Adenovirus Packaging 
Cell Lines Expressing The Cre Recombinase From 
Bacteriophage PI 

Cell lines expressing a range of Cre levels are used to optimize the amount of helper 
virus packaging that occurs during growth of the EAM vectors. The Cre-loxP system [Sauer 
and Henderson (1988) Proc. Natl. Acad. Sci. USA 85:5166] is employed to selectively disable 
helper virus packaging during growth of EAMs. The bacterial Cre recombinase catalyzes 
efficient recombination between a 34 bp target sequence called loxP. To delete a desired 
sequence. loxP sites are placed at each end of the sequence to be deleted in the same 
orientation: in the presence of Cre recombinase the intervening DNA segment is efficiently 
excised. 

Cell lines expressing a range of Cre recombinase levels are generated. The expression 
of too little Cre protein may result in high levels of helper virus being generated, which leads 
to unacceptably high levels of helper virus contaminating the final EAM preparation. If very 
high levels of Cre expression are present in a cell line, excision of the packaging signal from 
the helper virus would be 100% efficient (i.e., it would completely prevent helper virus 
packaging). As shown in Example 6, serial passage of EAM preparations containing low 
levels of helper virus increased the titer of the EAM. Therefore, it is desirable, at least in the 
initial passages of a serial passage that some helper virus capable of being packaged is 
present. A low level of packagable helper virus may be provided by using a cell line 
expressing levels of Cre recombinase which are not high enough to achieve excision of the 
packaging signals from 100% of the helper virus; these cell lines would be used early in the 
serial passaging of the EAM stock and a cell line expressing high enough levels of Cre 
recombinanse to completely prevent helper virus packaging would be used for the final 
passage. 
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Alternatively. EAMs may be prepared using a packaging cell line that supports high 
efficiency Cre recombinase-mediated excision of the packaging signals from the helper virus 

■ 

by transfection of the Cre-expressing cell line with the EAM plasmid followed by infection of 
these cells with /oxP-contaning helper virus using an MOI of > 1 .0. 

Human 293 cell lines that express a variety of levels of Cre recombinase are generated 
as follows. An expression vector containing the Cre coding region. pOG231. was 
cotransfected into 293 cells along with pcDNA3 (Invitrogen); pcDNA3 contains the neo gene 
under the control of the S V40 promoter. pOG23 1 uses the human CMV enhancer/promoter 
[derived from CDM8 (Invitrogen)] to express a modified Cre gene [pOG23 1 was obtained 
from S. Cf Gorman]. The modified Cre gene contained within pOG231 has had a nuclear 
localization signal inserted into the coding region to increase the efficiency of recombination. 

pOG231 was constructed as follows. A Bglll site was introduced into the 5 ? Xbal site 
of the synthethic intron of pMLSISCAT [Haung and Gorman (1990) Nucleic Acids Res. 
18:937] by linker tailing. A Bglll site in the synthethic intron of pMLSISCAT was destroyed 
and a Bamlil linker was inserted into the Pstl site at the 3" end of the synthethic intron in 
pMLSISCAT. Bglll and Smal sites and a nuclear localization signal were introduced into the 
y end of pMC-Cre [Gu et al (1993) Cell 73:1 155] using a PCR fragment that extended from 
the novel Bglll and Smal sites to the BamWl site in the Cre coding region. This PCR 
fragment was ligated to a BamWIISaR fragment containing a portion of the Cre coding region 
derived from pIC-Cre (Gu et aL supra) and the intron plus Cre coding sequence was inserted 
into a modified form of pOG44 [O'Gorman et al (1991) Science 251:1351] to generate 
pOG23 1 . The predicted sequence of pOG23 1 from the Bglll site to the fiawHI site located in 
the middle of the Cre coding sequence is listed in SEQ ID NO:l 1. 

One 60 mm dish of 293 cells (Microbix) were transfected with 10 |Xg of Pvull- 
linearized pOG23 1 and 1 ng of M>rf-linearized pcDNA3 using a standard calcium phosphate 
precipitation protocol. Two days after the addition of DNA, the transfected cells were split 
into three 100 mm dishes and 1000 iig/ml of active G418 was added to the medium. The 
cells were fed periodically with G418-contaning medium and three weeks later. 24 G418- 
resistant clones were isolated. 

The isolated clones were expanded for testing. Aliquots were frozen in liquid nitrogen 
at the earliest possible passage. The neomycin resistant cell lines were examined for the 
expression of Cre recombinase using following transfection assay. The neomycin resistant 
cells were transfected with PGK-l-GFP-lacZ (obtained from Sally Camper. Univ. of 
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Michigan. Ann Arbor. MI), which contains a green fluorescent protein (GFP) expression 
cassette that can be excised by Cre recombinase to allow expression of P-galactosidase: 
transfection was accomplised using standard calcium phosphate precipitation. Figure 1 8 
provides a schematic of a GFP/p-gal reporter construct suitable for assaying the expression of 
Cre recombinase in mammalian cells; GFP sequences and (3-gal (/.*.. lacZ) sequences are 
avaialbale from commmerical sources (e.g., Clonetech. Palo Alto. CA and Pharmacia Biotech. 
Piscataway. NJ. respectively). 

Control experiments verified that 293 cells transfected with PGK- 1 -GFP-lacZ 
expressed significant amounts of p-galactosidase only if these cells also expressed Cre 
recombinanse. p-galactosidase assays were performed as described in Example 6. Neomycin 
resistant cells expressing Cre recombinase were grouped as high, medium or low expressors 
based upon the amount of p-galactosidase activity produced (estimated by direct counting of 
P-galactosidase-positive cells per high-power field and by observing the level of staining and 
the rapidity with which the blue stain was apparent) when these cell lines were transfected 
15 with the GFP/p-gal reporter construct. Thirteen positive (i.e.. Oe-expressing) lines. D608#12. 
#7. #22. #18. #17, #4, #8, #2, #2/2, #13, #5. #15 and #21 were retained for further use. 

The results of this transfection analysis revealed that cultures of 293 cells expressing 
medium to high levels of Cre recombinase could be generated without apparent toxicity. 

20 b ) Generation Of Helper Adenovirus Strains That Contain loxP 

Sites Flanking The Adenovirus Packaging Signals 

Studies of EAM production demonstrated that the EAM vector has a packaging 
advantage over the helper adenovirus (Ex. 6). While not limiting the present invention to a 
particular mechanism, it is hypothesized that this packaging and replication advantage can be 

25 greatly increased by using helper viruses that approach the packaging size limits of Ad5 [Bett 
et al. (1993) J. Virol. 67:5911], by using viruses with mutations in E4 and/or E2 genes [Yeh 
el al. (1996) J. Virol. 70:559; Gorziglia et al. (1996) J. Virol. 70:4173; and Amalfitano et al. 
(1996) Proc. Natl. Acad. Sci. USA 93:3352], by inclusion of mutations or alterations in the 
packaging signals of the helper virus [Imler et al. (1995) Hum. Gene Ther. 6:711] and by 

30 combining these strategies. 

The Cre-loxP excision method is used to disable the packaging signals from the helper 
virus genomes. The Ad5 packaging domain extends from nucleotide 194 to 358 and is 
composed of five distinct elements that are functionally redundant [Hearing et al. (1987) J. 
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Virol. 61:2555]. Theoretically, any molecule containing the Ad5 origin of replication and 
packaging elements should replicate and be packaged into mature virions in the presence of 
non-defective helper virus. Disabling the packaging signals should allow replication and gene 
expression to proceed, but will prevent packaging of viral DNA into infectious particles. This 
in turn should allow the ratio of EAM to helper virus to be increased greatly. 

To disable the packaging signals within the helper virus used to encapsidate the EAMs, 
the loxP sequences are incorporated into a helper virus that has a genome approaching the 
maximal packaging size for Ad [-105 map units; Bett et al (1993). supra] which ftirther 
decreases the efficiency of helper virus packaging. Final virus size can be adjusted by the 
choice of introns inserted into the reporter gene, by the choice of reporter genes, or by 
including a variety of DNA fragments of various sizes to act as "stuffer" fragments. A 
convenient reporter gene is the alkaline phosphatase gene (see Ex. 6). The optimized Cre- 
loxP system is also incorporated into a helper viral backbone containing disruptions of any or 
all of the E1-E4 genes. Use of such deleted genomes requires viral growth on appropriate 
complementing cell lines, such as 293 cells expressing E2, and/or E4 gene products. The 
loxP sequences are incorporated into the helper virus by placing the loxP 
sequences on either side of the packaging signals on a shuttle vector. This modified shuttle 
vector is then used to recombine with the Ad DNA derived from a virus containing 
disruptions of any or all of the E1-E4 genes to produce the desired helper virus containing the 
packaging signals flanked by loxP sequences. 

pAdBglll, an adenovirus shuttle plasmid containing Ad sequences from 0-1 and 9-16 
map units (mu) was used as the starting material. Synthetic oligonucleotides were used to 
create a polylinker which was inserted at 1 mu within pAdBglll as follows. The Bglll LoxP 
oligo [5 " -G A AG ATCTAT A ACTTCGT ATAATGT ATGCT A 

TACGAAGTTATTACCGAAGAAATGGCTCGAGATCTTC-3* (SEQ ID NO:12) and its 
reverese complement [S'-GAAGATCTCGAGCCATTTCTTCGGTAATAACTTCGT 
ATAGCATACATTATACGAAGTTATAGATCTTC-3' (SEQ ID NO: 13)] were synthesized. 
The AflHI LoxP oligo [5 ' -CC AC ATGT AT A ACTTCGT AT AGC AT AC A 
TT ATACG A AGTTAT AC ATGTGG-3 ? (SEQ ID NO:14)] and its reverse complement [5 ? - 
CCACATGT AT AACTTCGT ATAATGT ATGCT AT ACGAAGTTATACATG TGG-3' (SEQ 

ID NO: 15)] were synthesized. The double stranded form of each loxP oligonuceotide was 
digested with the appropriate restriction enzyme (e.g., Bglll or Afllll) and inserted into 
pAdBglll which had been digested with Bglll and Afllll. This resulted in the insertion of 



-60- 



WO 98/17783 PCT7US97/19541 

LoxP sequences into the shuttle vector flanking the packaging signals that are located between 
0.5 and 1 mu of Ad5 (one mu equals 360 bp in the Ad5 genome: the sequence of the Ad5 
genome is Isited in SEQ ID NO:l). The 3" loxP sequence was inserted into the Bg/II site 
within pAdBglll. The 5* loxP sequence was inserted into the AfRW site located at base 143 
(-0.8 mu) (numbering relative to Ad5). 

DNA sequencing was used to verify the final structure of the modified shuttle plasmid 
between 0-1 mu. If Cre recombinase-mediated excision of the packaging signals is found to 
be too efficient (as judged by the production of too little helper virus), alternate sites for the 
insertion of the loxP sequences are used that would result in deletion of 2. 3 or 4 packaging 
sequences rather than all 5 [Grable and Hearing (1990) J. Virol. 64:2047]. The insertion of 
loxP sequences at sites along the Ad genome contained within the shuttle vector which would 
result in the deletion 2. 3 or 4 packaging sequences are easily made using the technique of 
recombinant PCR [Higuchi (1990) In: PCR Protocols: A Guide to Methods and 
Applications. Innis et ai (eds.) Academic Press. San Diego, CA, pp. 177-183]. The optimal 
amount of Cre recombinase-mediated excision of the packaging signals is that amount which 
permits the production of enough packaged helper virus to permit the slow spread of virus on 
the first plate of cells co-transfected with helper virus DNA (containing the loxP sequences) 
and the EAM vector. This permits serial passage of the EAM preparation onto a subsequent 
lawn of cells to increase the titer of the EAM preparation. Alternatively, if Cre recombinase- 
mediated excision of the packaging signals is essentially 100% efficient in all Cre-expressing 
cells lines (i.e.. regardless of the level of Oe-expression, that is even a cell line expressing a 
low level of Cre as judged by the GFP/p-gal assay described above), the packaged EAMs 
may be used along with helper virus (used at a MOI of -1.0) to infect the second or 
subsequent lawn of cells to permit serial passaging to increase the titer of the EAM 
preparation. 

Following introduction of the loxP sequences into the shuttle vector, the human 
placental alkaline phosphatase (HpAp) cDNA under control of the RSV promoter was inserted 
into the polylinker to provide a reporter gene for the helper virus. This is the same reporter 
used previously during EAM generation (Ex. 6). The HpAp sequences were inserted as 
follows. The /avP-containing shuttle vector was linearized with Xhol and the hpAp cassette 
was ligated into the Xhol site (a Xhol site was inserted into pAdBglll during the insertion of 
the loxP sequences as a Xlwl site was located on the 3 ? end of the loxP sequences inserted 
into the fl^/II site of pAdBglll). The HpAp cassette was constructed as follows. 
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pRSVhAPT40 (obtained from Gary Nabel. Univ. of Michigan. Ann Arbor. MI) was digested 
with EcoRI to generate an £coRI fragment containing the HpAP cDNA and the SV40 intron 
and polyadenylation sequences. pRc/RSV (Invitrogen) was digested with HindlU. then 
partially digested with EcoU. A 5.208 bp fragment was then size selected on an agarose gel. 
treated with calf alkaline phosphatase, and ligated to the fcoRI fragment derived from 
P RSVhAPT40 to generate pRc/RSVAP. pRc/RSVAP was then digested with Sail and Xhol to 
liberate the RSV promoter linked to the HpAP cDNA cassette (including the SV40 intron and 
polyadenylation sequences). This SaR-Xhol fragment was inserted into the /^-containing 
shuttle vector which had been digested with Sail and Xhol to generate pADLoxP-RSVAP. 

Helper virus containing loxP sites flanking the Ad packaging signals is generated by 
co-transfection of the LoxP shuttle plasmid (pADLoxP-RSVAP) and C/al-digested 
Ad5<//7001 DNA into 293 cells [Graham and Prevec (1991) Manipulation of Adenovirus 
Vectors. In Methods in Molecular Biology. Vol. 7: Gene Transfer and Expression Protocols. 
Murray (ed.). Humana Press Inc.. Clifton. NJ, pp. 109-128]. Figure 19 provides a schematic 
showing the recombination event between the loxP shuttle vector and the Ad5c//7001 genome. 
Co-transfection is carried out as described in Example 3c. 

Alternatively, the reporter gene may be inserted into the E3 region of pBHGlO or 
pBHGl 1 (using the unique Pad site) rather than into the polylinker located in the El region 
of the shuttle vector. The reporter gene-containing pBHGlO or 11 is then used in place of 
Ad5c//7001 for cotransfection of 293 cells along with the loxP-containing pAdBglll. 

Following cotransfection, recombinant plaques are picked, plaque purified, and tested 
for incorporation of both hpAp and loxP sequences by PCR and Southern analysis (Ex. 3). 
Viruses which contain loxP sites flanking the packaging signals and the marker gene (hpAp) 
are retained, propagated and purified. 

The isolated helper virus containing loxP sites flanking the packaging signals and the 
marker gene is then used to infect both 293 cells and the 293 cell lines that express Cre 
(section a. above). Cre recombinase-expressing cell lines that produce optimal levels of Cre 
recombinase when infected with the /oxP-containing helper virus are then used for the 
generation of EAMs as described below. 
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c) Generation Of Encapsidated Adenovirus Minichromosomes 
That Express Dystrophin 

The growth of EAMs is optimized using the following methods. In the first method, 
plasmid DNA from the EAM vector (Ex. 6) are co-transfected into 293 cells with purified 
viral DNA from the helper virus, and viruses are harvested 10-14 days later after appearance 
of a viral cytopathic effect (CPE) as described in Example 6. This approach is simple, yet 
potentially increases helper virus levels by allowing viral spread throughout the culture dishes. 

In the second method, the co-transfected 293 cells are overlaid with agar and single 
plaques are picked and tested for EAM activity by the ability to express P-galactosidase 
activity following infection of Hela cells (Ex. 6). This second approach is more time 
consuming, but will result in less contamination by helper. 

The ability of these two methods to generate EAMs are directly compared. The 
preferred method is that which produces the highest titer of EAM with the lowest 
contamination of helper virus. The efficiency of EAM generation may also be increased by 
using helper virus DNA that retains the terminal protein (TP) (/.<?.. the helper virus is used as 
a viral DNA-TPC as described in Ex. 3) The use of viral DNA-TPC has been shown to 
increase the efficiency of viral production following transfection of 293 cells by an order of 
magnitude [Miyake et ai (1996), supra]. 

The initial transfection will utilize 293 cells that do not express Cre recombinase. so 
that efficient spread of the helper can lead to large scale production of EAM. The method 
producing the highest ratio of EAM to helper virus is then employed to optimize conditions 
for the serial propagation of the EAM on 293 cells expressing Cre recombinase. This 
optimization is conducted using cell lines expressing different levels of Cre recombinase. The 
following variables are tested: 1) the ratio of input viral titer to cell number, 2) the number 
of serial passages to use for EAM generation, 3) the use of cell lines producing different 
levels of Cre to achieve the optimal ratio between high EAM titer and low helper titer, 4) 
continuous growth on Oe-producing 293 cells versus alternating between Oe-producing cells 
and the parental 293 cells, or to alternate between high and low Oe-producing cells and 5) 
CsCl purification of EAMs prior to re-infection of 293 cells increases the ratio of the final 
EAM/helper titers. The protocols that result in the highest yield of EAM with minimal helper 
virus are used to generate large volumes of crude viral lysates for purification by density 
gradient centrifugation (Ex. 6). 
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EAMs were purified from helper virus using standard CsCl density gradient 
centrifugation protocols in Example 6 and resulted in preparations containing -4% helper virus. 
In order to improve the physical separation of EAMs and helper virus, a variety of different 
centrifugation conditions are possible, including changing the gradient shape, type of rotor and 
tube used, combinations of step and continuous gradients, and the number of gradients used. 
Materials with better resolving powers than CsCl can also be employed. These include 
rubidium chloride and potassium bromide [Reich and Zarybnicky (1979) Annal. Biochem. 
94:193]. 

d) Use Of EAMs Encoding Dystrophin For Long Term 
Expression Of Dystrophin in Muscle 

To demonstrate the ability of the purified dystrophin expressing EAMs (prepared as 
described above) to deliver and express dystrophin in the muscle of an animal, the following 
experiment is conducted. First, purified dystrophin-EAMs are delivered to the muscle by 
direct intramuscular injection into newborn, 1 month, and adult (3 month) mouse quadriceps 
muscle. The dystrophin expressing EAM AdSpDys are injected into mdx mice and into 
transgenic mdx mice that express p-galactosidase in the pituitary gland [Tripathy et ai (1996) 
Nature Med. 2:545]. These latter mice are used to avoid potential immune-rejection of cells 
expressing p-galactosidase from the Ad5{iDys vector. An alternate EAM lacking the P- 
galactosidase reporter gene may also be employed; however, the presence of the p-Gal 
reporter simplifies EAM growth and purification (vectors lacking p-Gal have their purity 
estimated by PCR assays rather than by p-Gal assays). 

Following intramuscular injection of EAM. animals are sacrificed at intervals between 
1 week and 6 months to measure dystrophin expression [by western blot analysis and by 
immunofluorescence (Phelps et ai (1995) Hum. Mol. Genet. 4:1251 and Rafael et ai (1996) 
J. Cell Biol. 134:93] and muscle extracts will also be assayed for p-Gal activity [MacGregor 
et ai (1991), supra]. These results are compared with previous results obtained using current 
generation viral vectors {i.e.. containing deletions in El and E3 only) to demonstrate that 
EAMs improve the prospects for long term gene expression in muscle. 
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e) Use Of Dystrophin-EAMs To Prevent, Halt, Or Reverse The 
Dystrophic Symptoms That Develop In The Muscles Of mdx 
Mice 

To demonstrate that beneficial effects on dystrophic muscle are achieved by delivery 
5 of dystrophin expressing EAMs to mdx mice, the following experiments are conducted. 

Central nuclei counts are performed on soleus muscle at intervals following injection of EAM 
into newborn. 1 month, and 3 month (adult) mice (Phelps et al, 1995). Central nuclei arise 

* 

in mouse muscle only after a myofiber has undergone dystrophic necrosis followed by 
regeneration, and is a quantitative measure of the degree of dystrophy that has occurred in a 
10 muscle group (Phelps et a/., 1995). 

More informative assays are also contemplated. These assays require administration of 
EAMs to mouse diaphragm muscles. 

The diaphragm is severely affected in mdx mice (Stedman et aL 1991: Cox et a/., 
1993). and displays dramatic decreases in both force and power generation. Administration of 

15 virus to the diaphragm will allow the strength of the muscles to be measured at intervals 
following dystrophin delivery. The force and power generating assays developed by the 
Faulkner lab are used to measure the effect of dystrophin transgenes (Shrager et al, 1992; 
Rafael et al. 1994: Lynch et aL 1996). 

To detemine whether dystrophin delivery to dystrophic muscle reverses dystrophy or 

20 stabilizes muscle, varying amounts of the dystrophin EAM are delivered to the to diaphragm 
at different stages of the dystrophic process and then strength is measured at intervals 
following EAM administration. First, various titers of EAM are tested to determine the 
minimal amount of virus needed to transduce the majority of muscle fibers in the diaphragm. 
It has been shown that conventional adenovirus vectors can transduce the majority of 

8 

25 diaphragm fibers when 10 pfu are administrated by direct injection into the intraperitoneal 
cavity [Huard et al (1995) Gene Therapy 2:107]. In addition, it has been shown that 
transduction of a simple majority of fibers in a muscle group is sufficient to prevent virtually 
all the dystrophic symptoms in mice [Rafael et al (1994) Hum. Mol. Genet. 3:1725 and 
Phelps et al (1995) Hum. Mol. Genet. 4:1251]. Virus is administered to mdx animals at three 

30 different ages (neonatal, 1 month, and 3 months). Animals are sacrificed for physiological 
analysis of diaphragm muscle at two different times post infection ( 1 month and 3 months). 
Error control is achieved by performing these experiments in sextuplicate. Control animals 
consist of mock injected wild-type (C57B1/10) and dystrophic (mdx) mice. Three month old 
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mdx mice display a 40% reduction in force and power generation compared with wild-type 
mice, while 6 month animals display a greater than 50% reduction (Cox et ai (1993) Nature 
364:725: Rafael et ai (1994), supra\ Phelps et ai (1995). supra: and Corrado et ai (1996) J. 
Cell. Biol. 134:873]. 

EXAMPLE 8 

Improved Shuttle Vectors For The 
Production Of Helper Virus Containing LoxP Sites 



In the previous example, a shuttle vector containing adenoviral sequences extending 
from 9 mu to 16 mu of the Ad5 genome was modified to contain loxP sequences surrounding 
the packaging signals (the LoxP shuttle vector). This modified shuttle vector was then 
recombined with an Ad virus to produce a helper virus containing the loxP sequences. This 
helper virus was then used to infect cells expressing Cre recombinase along with DNA 
comprising a minichromosome containing the dystrophin gene, a reporter gene and the 
packaging signals and ITRs of Ad in order to preferentially package the minichromosomes. 
Using this approach the helper virus, which has had the packaging signals removed by Cre- 
loxP recombination, contains the majority of the Ad genome (only a portion of the E3 region 
is deleted). Thus, if low levels of helper virus are packaged and appear in the EAM 
preparation, the EAM preparation has the potential of passing on helper virus capable of 
directing the expression of Ad proteins in cells which are exposed to the EAM preparation. 
The expression of Ad proteins may lead to an immune response directed against the infected 
cells. 

Another approach to reducing the possibility that the EAM preparation contains helper 
virus capable of provoking an immune response is to use helper viruses containing deletions 
and/or mutations within the pol and pTP genes. Helper virus containing a deletion in the pol 
and/or pTP genes is cotransfected with the EAM construct into 293-derived cell lines 
expressing pol or pol and pTP to produce EAMs. Any helper virus present in the purified 
EAM preparation will be replication defective due to the deletion in the pol and/or pTP genes. 
0 As shown in Example 3, viruses containing a deletion in the pol gene are incapable of 

directing the expression of viral late genes: therefore, helper viruses containing a deletion in 
* the pol gene or the pol and pTP genes should not be capable of provoking an immune 
response {i.e.. a CTL response) against late viral proteins synthesized de novo. Shuttle 
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vectors contain deletions within the Ad pol and/or pTP genes are constructed as described 
below. 

a) Construction Of A Shuttle Vector Containing The Apol 
Deletion 

pAdBglll was modified to contain sequences corresponding to 9 to 40 mu of the Ad5 
genome as follows. pAdBglll was digested with Bglll and a linker/adapter containing an Ascl 
site was added to create pAdBglHAsc. pAdBglHAsc was then digested with Bstl 1071. 
Ad5aV7001 viral DNA was digested with Ascl and the ends were filled in using T4 DNA 
polymerase. The ^el-digested, T4 DNA polymerase filled Ad5d/7001 viral DNA was then 
digested with Bstl 1071 and the ^9.9 kb AscUBstl 107I(filled) fragment containing the pol and 
pTP genes was isolated (as described in Ex. 3) and ligated to the Bstl 1071-digested 
pAdBglHAsc to generate pAdAsc. pAdAsc is a shuttle vector which contains the genes 
encoding the DNA polymerase and preterminal protein (the inserted Ascl-Bstl 1071 fragment 
corresponds to nucleotides 5767 to 15. 671 of the Ad5 genome). 

A shuttle vector. pAdAscApol, which contains the 6 1 2 bp deletion in the pol gene 
(described in Ex. 3) was constructed as follows. Ad5ApolAE3I viral DNA (Ex. 3) was 
digested with Ascl and the ends were filled in using T4 DNA polymerase. The /isd-digested, 
T4 DNA polymerase filled Ad5ApolAE3I viral DNA was then digested with Atfll07I and the 
-9.3 kb Ascl-Bstl 107I(filled) fragment containing the deleted pol gene and the pTP gene was 
isolated {as described in Ex. 3) and ligated to Ay/1 1071-digested pAdBglHAsc to generate 
pAdAscApol. 

b) Construction Of A Shuttle Vector Containing The ApolApTP 
Deletion 

A shuttle vector, pAdAscApolApTP. which contains a 2.3 kb deletion within the pol 
and pTP genes (described in Ex. 5) was constructed as follows. 

pAXBApolApTPVARNA+tl3 (Ex. 5b) was digested with Ascl and the ends were filled in 
using T4 DNA polymerase. The ^.srl-digested. T4 DNA polymerase filled 
pAXBApolApTPVARNA+tl3 DNA was then digested with Bstl 1071 and the -7.6 kb AscU 
Bstl 107I(filIed) fragment containing the deleted pol gene and the pTP gene was isolated (as 
described in Ex. 3) and ligated to Bstl 1071-digested pAdBglHAsc to generate 
pAdAscApolApTP. 

In order to reduce the packaging of the above helper viruses, the pol" or pol", pTP" 
helper viruses can be modified to incorporate loxP sequences on either side of the packaging 
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signals as outlined in Example 7. The /oxP-containing pol" or pol". pTP" shuttle vectors 
(pTP- shuttle vectors may also be employed) are cotransfected into 293 cells expressing Cre 
recombinase and pol or Cre recombinase. pol and pTP. respectively along with an appropriate 
El" viral DNA-TPC (the El- viral DNA may also contain deletions elsewhere in the genome 
such as in the E4 genes or in the E2a gene as packaging cell lines expressing the E4 ORF 6 
or 6/7 and lines expressing E2a genes are avaialble) to generate helper virus containing loxP 
sites flanking the packaging signals as well as a deletion in the pol gene or the pol and pTP 
genes. The resulting helper virus(es) is used to cotransfect 293 cells expressing pol or pol 
and pTP along with the desired EAM construct. The resulting EAM preparation should 
contain little if any helper virus and any contaminating helper virus present would be 
replication defective and incapable of expressing viral late gene products. Helper viruses 
containing loxP sequences and deletions in all essential early genes may be employed in 
conjunction with Cre recombinanse-expressing cell lines expressing in tram the EL E4 ORF 
6. E2a. and E2b {e.g.. Ad polymerase and pTP) proteins (the E3 proteins are dispensible for 
growth in culture). Cell lines coexpressing Eh polymerase and pTP are provided herein. 
Cell lines expressing El and E4 proteins have been recently described [Krougliak and Graham 
(1995) Hum. Gene Ther. 6:1575 and Wang et ai (1995) Gene Ther. 2:775] and cell lines 
expressing El and E2a proteins have been recently described [Zhou et ai (1996) J. Virol. 
70:7030]. Therefore, a cell line co-expressing El, E2a, E2b. and E4 is consructed by 
introduction of expression plasmids containing the E2a and E4 coding regions into the E1-. 
Ad polymerase- and pTP-expressing cell lines of the present invention. These packaging cell 
lines are used in conjunction with helper viruses containing deletions in the El. E2a. E2b and 
E4 regions. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 




WO 98/17783 PCT7US97/19541 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Chamberlain, Jeffrey S. 

Amalfitano, Andrea 
Hauser, Michael A. 
Kumar- Singh, Rajendra 
Hartigan-O' Connor, Dennis J. 

(ii) TITLE OF INVENTION: IMPROVED ADENOVIRUS VECTORS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll, LLP 

(B) STREET: 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States Of America 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #l.o, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Ingolia, Diane E . 

(B) REGISTRATION NUMBER: 40,027 

(C) REFERENCE/DOCKET NUMBER: UM- 02484 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 5935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CATCATCAAT 


AATATACCTT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


GGGGTGGAGT 


60 


TTGTGACGTG 


GCGCGGGGCG 


TGGGAACGGG 


GCGGGTGACG 


TAGTAGTGTG 


GCGGAAGTGT 


120 


GATGTTGCAA 


GTGTGGCGGA 


ACACATGTAA 


GCGACGGATG 


TGGCAAAAGT 


GACGTTTTTG 


180 


GTGTGCGCCG 


GTGTACACAG 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


240 


TAAATTTGGG 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GGGAAAACTG 


AATAAGAGGA 


300 


AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


GGGCCGCGGG 


360 


GACTTTGACC 


GTTTACGTGG 


AGACTCGCCC 


AGGTGTTTTT 


CTCAGGTGTT 


TTCCGCGTTC 


420 


CGGGTCAAAG 


TTGGCGTTTT 


ATTATTATAG 


TCAGCTGACG 


TGTAGTGTAT 


TTATACCCGG 


480 
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TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 54 0 



TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 
AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 
TCCTAGCCAT TTTGAACCAC CTACCCTTCA CGAACTGTAT GATTTAGACG TGACGGCCCC 
CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 
GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 
CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 
CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACGA 
CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 
CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 
CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG CAGTGGGTGA 
TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 
GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 
CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 
CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 
CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 
GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 
CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 
TTGCGTGTGT GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 
GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG GTATATAATG 
CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 
TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGGAGG 
TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTAAGGAGGA TTACAAGTGG 
GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 
CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 
GCGGCTGCTG TTGCTTTTTT GAGTTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 
AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 
AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC GGAGGAGCAG 
CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG GAACCCGAGA 
GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 
GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAGCGGG 
GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG ACCAGACACC 
GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 
TGGCGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTG GCTGCAGCCA GGGGATGATT 
TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCACTTAG GCCAGATTGC AAGTACAAGA 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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PCI7US97/19541 


TP A fir* £ A R fT 


T> r** r VT\ A A T A Ti/"^ 

TG 1 AAATATC 


TV /"^i ^1 9k 9k MmMmm 

AGGAATTGTT 


GCTACATTTC 


TGGGAACGGG 


GCCGAGGTGG 


2640 


AfJATAf^kTA-r 1 

rvaA 1 MLjA 1 At. 


GGAGGATAGG 


GTGGCCTTTA 


GATGTAGCAT 


GATAAATATG 


TGGCCGGGGG 


2700 


1\jL.1 itjljV-Al 


GGACGGGGTG 


GTTATTATGA 


ATGTAAGGTT 


TACTGGCCCC 


AATTTTAGCG 


2760 




ppTP*P'p , /'* , a at 


ACCAACCTTA 


TCCTACACGG 


TGTAAGCTTC 


TATGGGTTTA 


2820 




1 G 1 GGAAGCC 


TGGACCGATG 


TAAGGGTTCG 


GGGCTGTGCC 


TTTTACTGCT 


2880 


cictt'C a a err* 

VjV- 1 VjLjAAGGG 


GGTGGTGTGT 


CGCCCCAAAA 


GCAGGGCTTC 


AATTAAGAAA TGCCTCTTTG 


2940 


A A RPPTrTRP 


CTTGGGTATC 


CTGTCTGAGG 


GTAACTCCAG 




TV TV r P/^T/* , t/9^/9m 

AATGTGGCCT 


3000 


wl_VjAv« 1 \j 1 VjVj 


TTGCTTCATG 


CTAGTGaAAA 


GCGTGGCTGT 


«nl lAAbLAl 


TV T\ ^9 TV r*9<9 ^9ffT9 9\ fT9 

AACATGGTAT 


3060 




CGAGGACAGG 


GCCTCTCAGA 


TGCTGACCTG 


CTcrr* a ccrT' 
V- i ULjGAv-GGL 


TV TV ^T^rtflf^/^* ^9^9 

AACTGTCACC 


3120 


1 Vj L» 1 Vj AAVj At, 


CATTCACGTA 


GCCAGCCACT 


CTCGCAAGGC 


L. 1 bbLLAb 1 G 


^n^j9^T^^^ Ti ^9j 9k #»9 9i 

TTTGAGCATA 


3180 


A f 1 AT A PTf2 A P 
**L. A 1 1 uAL 


LLGCTGTTCC 


TTGCATTTGG 


GTAACAGGAG 


GGGGGTGTTC 


CTACCTTACC 


3240 


A AT/2PA ATTT 


G AGT C AC AC T 


9k 9k 9k ifl -m mum ^_ 

AAGATATTGC 


TTGAGCCCGA 


GAGCATGTCC 


AAGGTGAACC 


3300 




ftt^iw^^^ TV ^% ti m^9 

G TTTG ACATG 


ACCATGAAGA 


TCTGGAAGGT 


^9 m ^9 ^9 f99 m fm 

GCTGAGGTAC 


GATGAGACCC 


3360 


pip & rrirpTr 

vj^>\L-L.Atjo 1 U 


CAGACCCTGC 


GAGTGTGGCG 


GTAAACATAT 


9\ ^9 9k 9k ^9 ^9 9k. >9 

TAGGAACCAG 


CCTGTGATGC 


3420 


1 obA lb I GAC 


^*9 9i ^9; y*9 9k mjh 

CGAGGAGCTG 


AGGCCCGATC 


ACTTGGTGCT 


GGCCTGCACC 


CGCGCTGAGT 


3480 


1 1GGL, 1 L 1 AG 


CGATGAAGAT 


ACAGATTGAG 


GTACTGAAAT 


GTGTGGGCGT 


GGCTTAAGGG 


3540 


1 ovjLjAAAGAA 


TATATAAGGT 


GGGGGTCTTA 


TGTAGTTTTG 


TATCTGTTTT 


GCAGCAGCCG 


3600 


L.v_Gwi_GLCAT 


GAGCACCAAC 


TCGTTTGATG 


GAAGCATTGT 


GAGCTCATAT 


TTGACAACGC 


3660 


GL.A1GCCCCC 


ATGGGCCGGG 


GTGCGTCAGA 


ATGTGATGGG 


^^•^99 ^^9 ^9 9k #wpm 

CTCCAGCATT 


GATGGTCGCC 


3720 


(-V.GIC.CTGCC 


^k ^1 V i"i iVh 

CGCAAACTCT 


ACTACCTTGA 


CCTACGAGAC 


CGTGTCTGGA 


ACGCCGTTGG 


3780 


AGACTGCAGC 


CTCCGCCGCC 


GCTTCAGCCG 


CTGCAGCCAC 


CGCCCGCGGG 


ATTGTGACTG 


3840 


At. 1 i I GLTTT 


CCTGAGCCCG 


CTTGCAAGCA 


GTGCAGCTTC 


CCGTTCATCC 


GCCCGCGATG 


3900 


ALAAb T TG AC 


GGCTCTTTTG 


GCACAATTGG 


ATTCTTTGAC 


CCGGGAACTT 


AATGTCGTTT 


3960 


L, i <-AGLAGCT 


GTTGGATCTG 


CGCCAGCAGG 


TTTCTGCCCT 


GAAGGCTTCC 


TCCCCTCCCA 


4020 


A i bLbbT x TA 


AAACATaAAT 


AAAAAACCAG 


ACTCTGTTTG 


GATTTGGATC 


AAGCAAGTGT 


4080 


rv 1 III i« rtrnr-^ rr> f»rp 


TTATTTAGGG 


GTTTTGCGCG 


CGCGGTAGGC 


CCGGGACCAG 


CGGTCTCGGT 


4140 


p*f^ r P r pp» a or r N "r» 
L-vj i 1 tsAGGG 1 


CCTGTGTATT 


TTTTCCAGGA 


CGTGGTAAAG 


GTGACTCTGG 


ATGTTCAGAT 


4200 


ALA 1 GGGL, AT 


AAGCCCGTCT 


CTGGGGTGGA 


GGTAGCACCA 


CTGCAGAGCT 


TCATGCTGCG 


4260 


VjGG 1 GG I GTT 


GTAGATGATC 


CAGTCGTAGC 


AGGAGCGCTG 


GGCGTGGTGC 


CTAAAAATGT 


4320 


u 1 1 1 u AG I AG 


CAAGCTGATT 


GCCAGGGGCA 


GGCCCTTGGT 


GTAAGTGTTT 


ACAAAGCGGT 


4380 


TAAGCTGGGA 


TGGGTGCATA 


CGTGGGGATA 


TGAGATGCAT 


CTTGGACTGT 


ATTTTTAGGT 


4440 


I GGL 1 ATGTT 


CCCAGCCATA 


TCCCTCCGGG 


GATTCATGTT 


GTGCAGAACC 


ACCAGCACAG 


4500 


TGTATCCGGT 


GCACTTGGGA 


AATTTGTCAT 


GTAGCTTAGA AGGAAATGCG 


TGGAAGAACT 


4560 


TGGAGACGCC 


CTTGTGACCT 


CCAAGATTTT 


CCATGCATTC 


GTCCATAATG 


ATGGCAATGG 


4620 


GCCCACGGGC 


GGCGGCCTGG 


GCGAAGATAT 


TTCTGGGATC 


ACTAACGTCA 


TAGTTGTGTT 


4680 
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ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTG CCAGACTGCG 474 0 



GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATTTCCCACG 4800 

CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA ACGGTTTCCG 4860 

GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 4 920 

CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAGCTGCAGC 4 980 

TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT CGCATGTTTT 504 0 

CCCTGACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT TGCAAGGAAG 5100 

CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 5160 

GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA TCCAGCATAT 5220 

CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT GCTCGTCCAG 5280 

ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 5340 

GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTGCT 5400 

GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGGTAGCATT TGACCATGGT 5460 

GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT TGGAGGAGGC 5520 

GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA GAAATACCGA 5580 

TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT CCACGAGCCA 5640 

GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT TGATGCGTTT 5700 

CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACGCTCGGTG ACGAAAAGGC TGTCCGTGTC 5760 

CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTGTT CCGCGGTCCT CCTCGTATAG 5820 

AAACTCGGAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 5880 

GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 5940 

GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTGTAGGCCA CGTGACCGGG 6000 

TGTTCCTGAA GGGGGGCTAT AAAAGGGGGT GGGGGCGCGT TCGTCCTCAC TCTCTTCCGC 6060 

ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG CGGGCATGAC 6120 

TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGGCCCGC 6180 

GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 6240 

AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA TGGAGCGCAG 6300 

GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACGTATTC 6360 

GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTGCGC TCGTCGGGCA CCAGGTGCAC 6420 

GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 6480 

GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG GTAGGGGGTC 6540 

TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCGGGCA GCAGGCGCGC 6600 

GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG CGCGGGCGGC 6660 

AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 6720 

GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 6780 
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AGGGTAGCAT 


CTTCCACCGC 


GGATGCTGGC 


AGCGAGGAGG 


TCGGGACCGA 


GGTTGCTACG 


CCTGAAGATG 


GCATGTGAGT 


TGGATGATAT 


GTCTGTGAGA 


CCTACCGCGT 


CACGCACGAA 


CAGCTCGGCG 


GTGACCTGCA 


CGTCTAGGGC 


ATACTTATCC 


TGTCCCTTTT 


TTTTCCACAG 


TTTCCAGTAC 


TCTTGGATCG 


GAAACCCGTC 


GAACTGGTTG 


ACGGCCTGGT 


AGGCGCAGCA 


CGCGGCCTTC 


CGGAGCGAGG 


TGTGGGTGAG 


GTACTGGTAT 


TTGAAGTCAG 


TGTCGTCGCA 


GCGCTTTTTG 


GAACGCGGAT 


TTGGCAGGGC 


CGCGCGAGGC 


ATAAAGTTGC 


GTGTGATGCG 


AATTACCTGG 


GCGGCGAGCA 


CGATCTCGTC 


AAGTTCCAAG 


AAGCGCGGGA 


TGCCCTTGAT 


GAGCTCTTCA 


GGGGAGCTGA 


GCCCGTGCTC 


GGAAGCGACG 


AATGAGCTCC 


ACAGGTCACG 


GGTCCTAAAC 


TGGCGACCTA 


TGGCCATTTT 


GTCTTGTTCC 


CAGCGGTCCC 


ATCCAAGGTT 


AGGCTCATCT 


CCGCCGAACT 


TCATGACCAG 


CCCCATCCAA 


GTATAGGTCT 


CTACATCGTA 


CGAGCCGATC 


GGGAAGAACT 


GGATCTCCCG 


GTGAAAGTAG 


AAGTCCCTGC 


GACGGGCCGA 


GCAGTACTGG 


CAGCGGTGCA 


CGGGCTGTAC 


CACAAGGAAG 


CAGAGTGGGA 


ATTTGAGCCC 


TACTTCGGCT 


GCTTGTCCTT 


GACCGTCTGG 


CACCACGCCG 


CGCGAGCCCA 


AAGTCCAGAT 


AACATCGCGC 


AGATGGGAGC 


TGTCCATGGT 


GAGCTCCTGC 


AGGTTTACCT 


CGCATAGACG 


CCTAATTTCC 


AGGGGCTGGT 


TGGTGGCGGC 


CGGCGCGACT 


ACGGTACCGC 


GCGGCGGGCG 


ATCTAAAAGC 


GGTGACGCGG 


GCGAGCCCCC 


AGAGGGGGCA 


GGGGCACGTC 


GGCGCCGCGC 


TTGCTGGCGA 


ACGCGACGAC 


GCGGCGGTTG 


ACGACGGGCC 


CGGTGAGCTT 


GAGCCTGAAA 


TTGACGGCGG 


CCTGGCGCAA 


AATCTCCTGC 
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GCGCACGTAA 


TCGTATAGTT 




CQAD 
OO-JU 


GGCGGGCTGC 


TCTGCTCGGA 


AG ACT AT PTC3 


q fi n 


GGTTGGACGC 


TGGAAGACGT 


TG AAG P Tfifi P 




GGAGGCGTAG 


GAGTCGCGCA 


GPTTGTTfS 21 P 




GCAGTAGTCC 


AGGGTTTCCT 


TP. 21 TCI 21 TCZTT* 


7080 


CTCGCGGTTG 


AGGACAAACT 




/140 


GGCCTCCGAA 


CGGTAAGAGC 


CTAf5f21Tf2T21 


/ Z UU 


TCCCTTTTCT 


ACGGGTAGCG 


PGT ATP, P PTP. 


/ Z O U 


CGCAAAGGTG 


TCCCTGACCA 


TG AP TTTH 21 CI 

V/ftv 111 \jn\j 


/ J z u 


TCCGCCCTGC 


TCCCAGAGCA 


AAAAGTPPfTT 




GAAGGTGACA 


TCGTTGAAGA 


GTATPTTTPP 


t a a n 


GAAGGGTCCC 


GGCACCTCGG 


AAPGGTTfSTT 
nn\.\3u Hull 




AAAGCCGTTG 


ATGTTGTGGC 


PP AP21 21 Tf2T a 


/ 3b 0 


GGAAGGCAAT 


TTTTTAAGTT 


PPTPf3T2inf2T 


/ O Z U 


TGAAAGGGCC 


CAGTCTGCAA 


\3f\ 1 OAUUU 1 i 


t c o r\ 
/ 680 


GGCCATTAGC 


ATTTGCAGGT 


cxcwcnnr* a a a 


7740 


TTCTGGGGTG 


ATGCAGTAGA 


AVju l Mb CGG 


"I O PX P\ 

7800 


CGCGGCTAGG 


TCTCGCGCGG 


LA« I LAC TAG 


7860 


CATGAAGGGC 


ACGAGCTGCT 


TCCCAAAGGC 


7920 


GGTGACAAAG 


AGACGCTCGG 




7980 


CCACCAATTG 


GAGGAGTGGC 


1 A i 1 Vj A 1 u 1 (j 


8040 


ACACTCGTGC 


TGGCTTTTGT 


a a 7\ a a r % r* w rr* c 


8100 


ATCCTGCACG 


AGGTTGACCT 






CTCGCCTGGC 


GGGTTTGGCT 


VjVjt 1 \jo 1 l 1 1 l 


o o o n 
oz z U 


CTGCTCGAGG 


GGAGTTACGG 


1 ukj/i 1 LuuAL 


8280 


GTCCGCGCGC 


GGCGGTCGGA 


vj\_ 1 luAlunL 


□ J40 


CTGGAGCTCC 


CGCGGCGTCA 


f5P,TP zi nn can 


<• 

OH Art 


GGTCAGGGCG 


CGGGCTAGAT 


P P A(V3Tf3 21 T 21 




GTCGATGGCT 


TGCAAGAGGC 


PGPZiTPPPPfs 


O DZ U 


GTGGGCCGCG 


GGGGTGTCCT 


TP.fi 21 Tf2 21 TY2 P 


aeon 
a b80 


GGAGGTAGGG 


GGGGCTCCGG 




8640 


GCGGGCAGGA 


GCTGGTGCTG 


CGCGCGTAGG 


8700 


ATCTCCTGAA 


TCTGGCGCCT 


CTGCGTGAAG 


8760 


GAGAGTTCGA 


CAGAATCAAT 


TTCGGTGTCG 


8820 


ACGTCTCCTG 


AGTTGTCTTG 


ATAGGCGATC 


8880 
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TCGGCCATGA ACTGCTCGAT CTCTTCCTCC TGGAGATCTC CGCGTCCGGC TCGCTCCACG 8940 
GTGGCGGCGA GGTCGTTGGA AATGCGGGCC ATGAGCTGCG AGAAGGCGTT GAGGCCTCCC 9000 



9060 
9120 
9180 
9240 
9300 



TCGTTCCAGA CGCGGCTGTA GACCACGCCC CCTTCGGCAT CGCGGGCGCG CATGACCACC 
TGCGCGAGAT TGAGCTCCAC GTGCCGGGCG AAGACGGCGT AGTTTCGCAG GCGCTGAAAG 
AGGTAGTTGA GGGTGGTGGC GGTGTGTTCT GCCACGAAGA AGTACATAAC CCAGCGTCGC 
AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGGCCTC GTAGAAGTCC 
ACGGCGAAGT TGAAAAACTG GGAGTTGCGC GCCGACACGG TTAACTCCTC CTCCAGAAGA 
CGGATGAGCT CGGCGACAGT GTCGCGCACC TCGCGCTCAA AGGCTACAGG GGCCTCTTCT 9360 
TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT CTTCTTCTGG CGGCGGTGGG 
GGAGGGGGGA CACGGCGGCG ACGACGGCGC ACCGGGAGGC GGTCGACAAA GCGCTCGATC 
ATCTCCCCGC GGCGACGGCG CATGGTCTCG GTGACGGCGC GGCCGTTCTC GCGGGGGCGC 
AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT GCCATGCGGC 
AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTGTG TAGGTACTCC GCCGCCGAGG 
GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GTCTAACCAG 
TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCGGGCGGCA GCGGGCGGCG GTCGGGGTTG 
TTTCTGGCGG AGGTGCTGCT GATGATGTAA TTAAAGTAGG CGGTCTTGAG ACGGCGGATG 
GTCGACAGAA GCACCATGTC CTTGGGTCCG GCCTGCTGAA TGCGCAGGCG GTCGGCCATG 
CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT GAGCCTTTCT 
ACCGGCACTT CTTCTTCTCC TTCCTCTTGT CCTGCATCTC TTGCATCTAT CGCTGCGGCG 
GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC CCCGAAGCCC 
CTCATCGGCT GAAGCAGGGC TAGGTCGGCG ACAACGCGCT CGGCTAATAT GGCCTGCTGC 
ACCTGCGTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGCGGTGGTA TGCGCCCGTG 
TTGATGGTGT AAGTGCAGTT GGCCATAACG GACCAGTTAA CGGTCTGGTG ACCCGGCTGC 
GAGAGCTCGG TGTACCTGAG ACGCGAGTAA GCCCTCGAGT CAAATACGTA GTCGTTGCAA 
GTCCGCACCA GGTACTGGTA TCCCACCAAA AAGTGCGGCG GCGGCTGGCG GTAGAGGGGC 
CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG ATGATATCCG 
TAGATGTACC TGGACATCCA GGTGATGCCG GCGGCGGTGG TGGAGGCGCG CGGAAAGTCG 
CGGACGCGGT TCCAGATGTT GCGCAGCGGC AAAAAGTGCT CCATGGTCGG GACGCTCTGG 
CCGGTCAGGC GCGCGCAATC GTTGACGCTC TAGACCGTGC AAAAGGAGAG CCTGTAAGCG 
GGCACTCTTC CGTGGTCTGG TGGATAAATT CGCAAGGGTA TCATGGCGGA CGACCGGGGT 
TCGAGCCCCG TATCCGGCCG TCCGCCGTGA TCCATGCGGT TACCGCCCGC GTGTCGAACC 
CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC AGGCGCGGCG 
GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC GTAAGCGGTT AGGCTGGAAA 
GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGGAGGGT TATTTTCCAA GGGTTGAGTC 
GCGGGACCCC CGGTTCGAGT CTCGGACCGG CCGGACTGCG GCGAACGGGG GTTTGCCTCC 



9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
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CCGTCATGCA 


AGACCCCGCT 


TGCAAATTCC 


TTTTCCCAGA 


TGCATCCGGT 


GCTGCGGCAG 


CAAGAGCAGC 


GGCAGACATG 


CAGGGCACCC 


ACATCCGCGG 


TTGACGCGGC 


AGCAGATGGT 


CACTACCTGG 


ACTTGGAGGA 


GGGCGAGGGP 


CGGTACCCAA 


GGGTGCAGCT 


GAAGCGTGAT 


CTGTTTCGCG 


ACCGCGAGGG 


AGAGGAGCCP 


GGGCGCGAGC 


TGCGGCATGG 


CCTGAATCGC 


CCCGACGCGC 


GAACCGGGAT 


TAGTCCCGCG 


ACCGCATACG 


AGCAGACGGT 


GAACCAGGAG 


GTGCGTACGC 


TTGTGGCGCG 


CGAGGAGGTG 


GTAAGCGCGC 


TGGAGCAAAA 


CCCAAATAGC 


GTGCAGCACA 


GCAGGGACAA 


CGAGGCATTC 


GAGGGCCGCT 


GGCTGCTCGA 


TTTGATAAAC 


AGCTTGAGCC 


TGGCTGACAA 


GGTGGCCGCC 


TTTTACGCCC 


GCAAGATATA 


CCATACCCCT 


GAGGGGTTCT 


ACATGCGCAT 




TATCGCAACG 


AGCGCATCCA 


C AAGGP Cd TH 


CGCGAGCTGA 


TGCACAGCCT 




GCCGAGTCCT 


ACTTTGACGC 




GAGGCAGCTG 


GGGCCGGACC 


TGGGPTrmpr; 


GGCGTGGAGG 


AATATGACGA 


GGAPf^ATf^an 


GTGATGTTTC 


TGATCAGATG 


ATGPAAfiaPrt 


AGAGCCAGCC 


GTCCGGCCTT 


AAC T C PA Pnn 


TGTCGCTGAC 


TGCGCGCAAT 


CPTnAPnp^T 


CCGCAATTCT 


GGAAGCGGTG 


GTCCCGGCGP 


CGATCGTAAA 


CGCGCTGGCC 


GAAAACAGGG 


ACGACGCGCT 


GCTTCAGCGC 


GTGGCTCGTT 


ACCGGCTGGT 


GGGGGATGTG 


CGCGAGGPPG 


GCAACCTGGG 


CTCCATGGTT 


GCAPTAA APG 


CGCGGGGACA 


GGAGGACTAC 


APP A A PTTTG 


CACCGCAAAG 


TGAGGTGTAC 


CAGTCTGGGC 


GCCTGCAGAC 


CGTAAACCTG 


AGCCAGGCTT 


GGGCTCCCAC 


AGGCGACCGC 


GCGACCGTGT 


TGCTGCTGCT 


AATAGCGCCC 


TTCACGGACA 
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TCCGGAAACA 


GGGAPGAGPP 




11040 


ATGCGCCCCC 


CTPPTPAGPA 


CZC Cine A 7\p7\r 
Vj Vjvjt u AAG AG 


11100 


TPPPPTPPTP 


CTZLCTTir'ClTr 
v- X rtLLuLu X L. 


AvjGAGGGGCG 


11160 


GATTAPGAAP 




L.V-GGGCCCGG 


11220 






C TCTCCTGAG 


11280 


APGPGTGAGG 


P f2 T li. P fTY"* P 
v- 1 Av». vj X vjV- L 


GCGGCAGAAC 


11340 


GAGGAGATGP 


Grzf2 a Trr a a a 


Gil CCACGCA 


11400 


GAGCGGTTGP 


TGPGPPa/^na 


r*P TV Till 

GGAv-TTTGAG 


11460 


CGCGCACACG 


TGGPGGPPPP 


t.uAv,v, xGGTA 


11520 


ATTAACTTTP 


AAA A AaGPTT 


1 AAv» AAC C AC 


11580 


G CT AT AGGAP 


TfiaTnr&TrT 

X ortiuLAl L. 1 


G 1 GGGACTTT 


11640 


AAGCCGCTPA 


TGGPGPa<^PT 


Gl 1CCTTATA 


11700 


AGGGATGPGP 




TV PTJi /*» TV /*i riPP 

AG x AGAGCCC 


11760 


ATCCTGCAGA 


uLrt X H\3 X Ljvj X 


GLAGGAGCGC 


11820 


ATPAAPTATT 




CCTGGGCAAG 


11880 


TAPGTTPPPA 




GGTAAAGATC 


11940 


riTflPTTA HPT 


1 CaAGCGACGA 


CCTGGGCGTT 


12000 




CjGCGGCGCGA 


GCTCAGCGAC 


12060 


V- X UOL X uuLn 


v-vjOGCAGCGG 


CGATAGAGAG 


12120 


V- X \JL-\J\- X uou 


v-CCCAAGCCG 


ACGCGCCCTG 


12180 




UGCGCGCTGG 


CAACGTCGGC 


12240 




AGGACGGCGA 


GTACTAAGCG 


12300 


p iv a pno a p r 1 


GGCGGTGCGG 


GCGGCGCTGC 


12360 


APGAPTGGPfS 


v. V. AGG X (- A X G 


P* TV t~*(**r** TV (T^/"t IV 

GACCGCATCA 


12420 


tppggp a nr a 


G\»\,GL.AGGCC 


AAC cggctct 


12480 


GPGPAAAPPP 


LAt Gv- AL- G AG 


TV TtPrnn/^^mnn 

AAGGTGCTGG 


12540 


PPATPPfZGPP 


v- GAv-GAGG v. v_ 


GGCCTGGTCT 


12600 


APA AP AGPGG 


X Gv_AG 


TV TV TV /*im^^^^ 

ACCAACCTGG 


12660 


TGGPGPaGPf2 


x GAGGGCGCG 


CAGCAGCAGG 


12720 


PPTTPPTfuar' 

n-V- X X v«.v_ X bnu 


1 A v> AUAG LLL 


r^T ^( TV TV ^l^m^i 

GCCAACGTGC 


12780 


TGAGPGPaPT 


GV.GGU J. AA 1 G 


/ »»T'P* TV nm^ tv tv 

G T G AC TG AGA 


12840 


CAGACTATTT 


TTTCCAGACC 


AGTAGACAAG 


12900 


TCAAAAACTT 


GCAGGGGCTG 


TGGGGGGTGC 


12960 


CTAGCTTGCT 


GACGCCCAAC 


TCGCGCCTGT 


13020 


GTGGCAGCGT 


GTCCCGGGAC 


ACATACCTAG 


13080 
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GTCACTT3CT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GCATGTGGAC GAGCATACTT 13X40 
TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCCTGGAGG 
CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 
ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 
GCGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 
TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCGCGGCCG 
CCGTGAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 
GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC TGGGACGACA 
TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 
AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG TCCGATCTAG 
GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGGTCTCTTA 
CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 
TGCTGCAGCC GCAGCGCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 
GCCTAGTGGA CAAGATGAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 
GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGTGGGAGG 
ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 
CGCACCTTCG CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 
AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 
GCGCGCGGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC 
GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 
TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 
GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTCAAAACA ATGACTACAG 
CCGGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 
CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 
GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 
CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 
GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 
CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 
GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 
CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACACCGAACA 
GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 
CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 
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CACCTTTGCC 


ACACGGGCTG 


AGGAGAAGCG 


CGCTGAGGCC 


GAAGCAGPGG 
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CCGAAGCTGC 15240 


CGCCCCCSCT 


GCGCAACCCG 


AGGTCGAGAA 


GCCTCAGAAG 


AAAPPGGTnA 

rw\\* www X \JJ-\ 


TCAAACCCCT 


15300 


GACAGAGGAC 


AGCAAGAAAC 


GCAGTTACAA 


CCTAATAAGC 


AATGAPAnPA 


CCTTCACCCA 


15360 


GTACCGCAGC 


TGGTACCTTG 


CATACAACTA 


CGGCGACCCT 


PAfJAPPf^riAA 


TCCGCTCATG 


15420 


GACCCTGCTT 


TGCACTCCTG 


ACGTAACCTG 


CGGCTCGGAG 


CAGGTPTAPT 

wanju x w x x 


GGTCGTTGCC 


15480 


AGACATGATG 


CAAGACCCCG 


TGACCTTCCG 


CTCCACGCGC 


CAGATPAfJPA 

VOVft X wAw*wA 


ACTTTCCGGT 


15540 


GGTGGGCGCC 


GAGCTGTTGC 


CCGTGCACTC 

^mr ^ ^mwm mm ^m> 


CAAGAGCTTC 


TAPAAPGAPP 

X A w An w WAN- w 


AGGCCGTCTA 


15600 


CTCCCAACTC 


ATCCGCCAGT 


TTACCTCTCT 


GACCCACGTG 


TTPAATPC5PT 

X X wAA X w V7 w X 


TTCCCGAGAA 


15660 


CCAGATTTTG 


GCGCGCCCGC 


CAGCCCCCAC 


CATCACCACC 


GTC APTfS A A A 

w X UnVJ 1 Vj aaa 


ACGTTCCTGC 


15720 


TCTCACAGAT 


CACGGGACGC 


TACCGCTGCG 


CAACAGCATC 


GRAnfSAfJTPP 


AGCGAGTGAC 


15780 


CATTACTGAC 


GCCAGACGCC 


GCACCTGCCC 


CTACGTTTAC 


AAGGPPPTftfl 

AAwww w w X OvJ 


GCATAGTCTC 


15840 


GCCGCGCGTC 


CTATCGAGCC 


GCACTTTTTG 


AGCAAGCATG 


TCP ATP PTT A 

•LwwAXwwX Xa 


TATCGCCCAG 


15900 


CAATAACACA 


GGCTGGGGCC 


TGCGCTTCCC 

^™ ^» ^ mm ^m ^m 


AAGCAAGATG 


X X X UUvvubVJ 


CCAAGAAGCG 


15960 


CTCCGACCAA 


CACCCAGTGC 


GCGTGCGCGG 


GCACTAC CGC 


PPPPPTYSfirc 

VJwOwww IVjVjO 


GCGCGCACAA 


16020 


ACGCGGCCGC 


ACTGGGCGCA 


CCACCGTCGA 




w'AwUww'w' 1 \J\J 


TGGAGGAGGC 


16080 


GCGCAACTAC 


ACGCCCACGC 


CGCCACCAGT 




czTLCczcnnrT'Ti 

UrtLuLuuLLn 


TTCAGACCGT 


16140 


GGTGCGCGGA 


GCCCGGCGCT 


ATGCTAAAAT 






TAGCACGTCG 


16200 


CCACCGCCGC 


CGACCCGGCA 


CTGCCGCCCA 


ACG CGCGG 




TTAACCGCGC 


16260 


ACGTCGCACC 


GGCCGACGGG 


CGGCCATGCG 

^b» ^b* ^b* ^mm m am \amW ^mmw ^aj 




nbuL I OUw wU 


CGGGTATTGT 


16320 


CACTGTGCCC 


CCCAGGTCCA 


GGCGACGAGC 

^b» ■m*w>*m0+ m W* *ajm\ m*mj 


GGCCGCCGPA 




CCATTAGTGC 


16380 


TATGACTCAG 


GGTCGCAGGG 


GCAACGTGTA 

*m* *—rm mm m 1^ mm \mJ mm 4T\ 


TTGGGTGPfIC 


WAw 1 Luu I1a 


GCGGCCTGCG 


16440 


CGTGCCCGTG 


CGCACCCGCC 


CCCCGCGCAA 


CTAGATTGCA 


Af?A A A A A APT" 


ACTTAGACTC 


16500 


GTACTGTTGT 


ATGTATCCAG 


CGGCGGCGGC 

^bT mmm ^m* *m0 *mmW ^aw \J Vj X#» 


GCG CAAPdAA 


ww InlUl wA 


AGCGCAAAAT 


16560 


CAAAGAAGAG 


ATGCTCCAGG 


TCATCGCGCC 


GGAGATPTAT 




AGAAGGAAGA 


16620 


GCAGGATTAC 


AAGCCCCGAA 


AGCTAAAGCG 


GGTPAAAAAfi 


aaaaaOaaaLj 


ATGATGATGA 


16680 


TGAACTTGAC 


GACGAGGTGG 


AACTGCTGCA 


WWW X Aw WV3 w W 


p p p a f^nr^r* a r* 

w w wAUw*wOaw 


GGGTACAGTG 


16740 


GAAAGGTCGA 


CGCGTAAAAC 


GTGTTTTGCG 

^ ' mm mm. mm mm am. 


ACC CGGC AP C 

Aw w w\J Wi\_ w 


a ppnT a (7tpt 

AwwOl fiUi V_ 1 


TTACGCCCGG 


16800 


TGAGCGCTCC 


ACCCGCACCT 


ACAAGCGCGT 


GTATGATGAG 




ACGAGGACCT 


16860 


GCTTGAGCAG 


GCCAACGAGC 


GCCTCGGGGA 


wX X XVJwwXAw 


VJVJAAaLI w O w* w 


ATAAGGACAT 


16920 


GCTGGCGTTG 


CCGCTGGACG 


AGG G C AAC PC 


AAPAPPTAflP 


w X AA/iVJwwww- 


TAACACTGCA 


16980 


GCAGGTGCTG 


CCCGCGCTTG 


CACCGTCCGA 


AG A A A AfiPfSP 


wLL 1 aaaOw 


GCGAGTCTGG 


17040 


TGACTTGGCA 


CCCACCGTGC 


AGCTGATGGT 


ACCPAAGPGP 

aw w w nn\j w w w 


PAf3Pr;APTcr5 


AAGATGTCTT 


17100 


HfJAA A A A ATH 




CTGGGCTGGA 


gmm mm *mm mm\ mm mmm ***m mmm ^mm 

GCCCGAGGTC 


CGCGTGCGGC 


CAATCAAGCA 


17160 


GGTGGCGCCG 


GGACTGGGCG 


TGCAGACCGT 


GGACGTTCAG 


ATACCCACTA 


CCAGTAGCAC 


17220 


CAGTATTGCC 


ACCGCCACAG 


AGGGCATGGA 


GACACAAACG 


TCCCCGGTTG 


CCTCAGCGGT 


17280 
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GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 17340 

AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT CGAGGAAGTA -17400 

CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 17460 

CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC GCCGAACCAC 17520 

CACTGGAACC CGCCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG 17580 

CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 17640 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCG 17700 

TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 17760 

CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 17820 

GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC 17880 

CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 17940 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA CTATTTTGTA 18000 

GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC CCGTTCATGG 18060 

GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC 18120 

TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 18180 

ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 18240 

TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CAGGCAGTGC 18300 

AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACCGGCCG 18 360 

TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 18420 

CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 18480 

CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 18540 

CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCGACCG 18600 

CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 18660 

CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 18 720 

GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 1878 0 

ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC CGCTTTCCAA 18840 

GATGGCTACC CCTTCGATGA TGCCGCAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 18900 

CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA CGTACTTCAG 18960 

CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGCGCCTACG CACGACGTGA CCACAGACCG 19020 

GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG CGTACTCGTA 19080 

CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG CTTCCACGTA 19140 

CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGGCACTGC 19200 

CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAGCTGCTAC 19260 

TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAGCA 19320 

AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA TAAATATTAC 19380 
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AAAGGAGGGT 


ATTCAAATAG 


GTGTCGAAGG 


TCAACCTGAA 


CCTCAAATAG 


GAGAATCTCA 


TGGGAGAGTC 


CTTAAAAAGA 


CTACCCCAAT 


CACAAATGAA 


AATGGAGGGC 


AAGGCATTCT 


TCAAGTGGAA 


ATGCAATTTT 


TCTCAACTAC 


GACTCCTAAA 


GTGGTATTGT 


ACAGTGAAGA 


TTCTTACATG 


CCCACTATTA 


AGGAAGGTAA 


GCCCAACAGG 


CCTAATTACA 


TTGCTTTTAG 


CAGCACGGGT 


AATATGGGTG 


TTCTGGCGGG 


TTTGCAAGAC 


AGAAACACAG 


AGCTTTCATA 


AACCAGGTAC 


TTTTCTATGT 


GGAATCAGGC 


TATTGAAAAT 


CATGGAACTG 


AAGATGAACT 


GATTAATACA 


GAGACTCTTA 


CCAAGGTAAA 


AAAAGATGCT 


ACAGAATTTT 


CAGATAAAAA 


GGAAATCAAT 


CTAAATGCCA 


ACCTGTGGAG 

• m^m ^m m* 


TTTGCCCGAC 


AAGCTAAAGT 


ACAGTCCTTC 


CTACGACTAC 


ATGAACAAGC 


GAGTGGTGGC 


TGGAGCACGC 


TGGTCCCTTG 


ACTATATGGA 


TGCTGGCCTG 


CGCTACCGCT 


CAATGTTGCT 


CCAGGTGCCT 


CAGAAGTTCT 


TTGCCATTAA 


CTACGAGTGG 


AACTTCAGGA 


AGGATGTTAA 


CCTAAGGGTT 


GACGGAGCCA 


GCATTAAGTT 


CCCCATGGCC 


CACAACACCG 


CCTCCACGCT 


CCAGTCCTTT 


AACGACTATC 


TCTCCGCCGC 


TACCAACGTG 


CCCATATCCA 


TCCCCTCCCG 


CACGCGCCTT 


AAGACTAAGG 


AAACCCCATC 


CTACTCTGGC 


TCTATACCCT 


ACCTAGATGG 


GGTGGCCATT 


ACCTTTGACT 


CTTCTGTCAG 


CAACGAGTTT 


GAAATTAAGC 


GCTCAGTTGA 


CATGACCAAA 


GACTGGTTCC 


TGGTACAAAT 


CTTCTATATC 


CCAGAGAGCT 


ACAAGGACCG 


L A 1 (jACjL v_GT 


CAGGTGGTGG 


ATGATACTAA 


ACACCAACAC 


AACAACTCTG 


GATTTGTTGG 


GGCCTACCCT 


GCTAACTTCC 


CCTATCCGCT 


CCAGAAAAAG 


TTTCTTTGCG 


ATCGCACCCT 
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TCAAACACCT 


AAATATGCCG 


ATAAAACATT 




GTGGTACGAA 


ACTGAAATTA 


ATCATGCAGC 




GAAACCATGT 


TACGGTTCAT 


ATGCAAAACC 




TGTAAAGCAA 


CAAAATGGAA 


AGCTAGAAAG 


X J O 4i U 


TGAGGCGACC 


GCAGGCAATG 


G TG AT AA CTT 




TGTAGATATA 


GAAACCCCAG 


ACACTCATAT 


X3 / tu 


CTCACGAGAA 


CTAATGGGCC 


AACAATCTAT 


1 3oUU 


GGACAATTTT 


ATTGGTCTAA 


TGTATTACAA 


x> _7 O O U 


CCAAGCATCG 


CAGTTGAATG 


CTGTTGTAGA 




CCAGCTTTTG 


CTTGATTCCA 


TTGGTGATAG 




TGTTGACAGC 


TATGATCCAG 


ATGTTAGAAT 


0 a n a n 


TCCAAATTAC 


TGCTTTCCAC 


TGGGAGGTGT 


"5 A T ft ft 


ACCTAAAACA 
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ATGGATGGGA 
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O ft "5 O ft 

^. U 4 Z 0 


AAATTTCCTG 
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T ft O D ft 

20280 


CAACGTAAAA 
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O ft ^ >1 ft 
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TCCCGGGTTA 


GTGGACTGCT 


ACATTAACCT 


ft y* ft f\ 

20400 


CAACGTCAAC 


CCATTTAACC 


ACCACCGCAA 


2U4oO 


GGGCAATGGT 


CGCTATGTGC 


CCTTCCACAT 


"5 ft C O ft 
2 U 32 0 


AAACCTCCTT 


CTCCTGCCGG 


GCTCATACAC 


0 ft C Q ft 


CATGGTTCTG 


CAGAGCTCCC 


TAGGAAATGA 


n ft /r A ft 
2 Ub4 U 


TGATAGCATT 


TGCCTTTACG 


CCACCTTCTT 


O ft *?ft ft 
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CTTAGAAACG 
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CAACATGCTC 


TACCCTATAC 
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ft 0 0 ft 
2 U o2 U 


CAACTGGGCG 
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GCTTTCCGCG 


GCTGGGCCTT 


"5 ft O O ft 
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GGCTACGACC 


CTTATTACAC 


T ft Q ft 

2 094 0 


AACCTTTTAC 


CTCAACCACA 


CCTTTAAGAA 


21000 


CTGGCCTGGC 


AATGACCGCC 
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O 1 ft C ft 
2XUO 0 


CGGGGAGGGT 


TACAACGTTG 


CCCAGTGTAA 


*5 "1 1 O ft 
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GCTAGCTAAC 


TACAACATTG 


GCTACCAGGG 


•5 1 1 Q ft 

2 1 1 0 U 


CATGTACTCC 


TTCTTTAGAA ACTTCCAGCC 


1 ^ A ft 

21240 


ATACAAGGAC 


TACCAACAGG 


TGGGCATCCT 


21300 


CTACCTTGCC 


CCCACCATGC 


GCGAAGGACA 


21360 


TATAGGCAAG 


ACCGCAGTTG 


ACAGCATTAC 


21420 


TTGGCGCATC 


CCATTCTCCA GTAACTTTAT 


21480 
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GTCCATGGGC GCACTCACAG ACCTGGGCCA AAACCTTCTC TACGCCAACT CCGCCCACGC 21540 
GCTAGACATG ACTTTTGAGG TGGATCCCAT GGACGAGCCC ACCCTTCTTT ATGTTTTGTT 21600 
TGAAGTCTTT GACGTGGTCC GTGTGCACCG GCCGCACCGC GGCGTCATCG AAACCGTGTA 21660 
CCTGCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC AACATCAACA 21720 
ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCATTGTCAA AGATCTTGGT 21780 
TGTGGGCCAT ATTTTTTGGG CACCTATGAC AAGCGCTTTC CAGGCTTTGT TTCTCCACAC 21840 
AAGCTCGCCT GCGCCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT ACACTGGATG 
GCCTTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TGGCTTTTCT 
GACCAGCGAC TCAAGCAGGT TTACCAGTTT GAGTACGAGT CACTCCTGCG CCGTAGCGCC 
ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG CGTACAGGGG 
CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT TGCCAACTGG 
CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT ACCCAACTCC 
ATGCTCAACA GTCCCCAGGT ACAGCCCACC CTGCGTCGCA ACCAGGAACA GCTCTACAGC 
TTCCTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAG GAGCGCCACT 
TCTTTTTGTC ACTTGAAAAA CATGTAAAAA TAATGTACTA GAGACACTTT CAATAAAGGC 
AAATGCTTTT ATTTGTACAC TCTCGGGTGA TTATTTACCC CCACCCTTGC CGTCTGCGCC 
GTTTAAAAAT CAAAGGGGTT CTGCCGCGCA TCGCTATGCG CCACTGGCAG GGACACGTTG 
CGATACTGGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG CAGCTCGGTG 
AAGTTTTCAC TCCACAGGCT GCGCACCATC ACCAACGCGT TTAGCAGGTC GGGCGCCGAT 
ATCTTGAAGT CGCAGTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA CACAGGGTTG 
CAGCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT CTTGTCGGAG 
ATCAGATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGGAGTCAA CTTTGGTAGC 
TGCCTTCCCA AAAAGGGCGC GTGCCCAGGC TTTGAGTTGC ACTCGCACCG TAGTGGCATC 
AAAAGGTGAC CGTGCCCGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA AGCCTTGATC 
TGCTTAAAAG CCACCTGAGC CTTTGCGCCT TCAGAGAAGA ACATGCCGCA AGACTTGCCG 
GAAAACTGAT TGGCCGGACA GGCCGCGTCG TGCACGCAGC ACCTTGCGTC GGTGTTGGAG 
ATCTGCACCA CATTTCGGCC CCACCGGTTC TTCACGATCT TGGCCTTGCT AGACTGCTCC 
TTCAGCGCGC GCTGCCCGTT TTCGCTCGTC ACATCCATTT CAATCACGTG CTCCTTATTT 
ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGCGCA GCGGTGCAGC 
CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA CGACTGCAGG 
TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGGTCT TGTTGCTGGT GAAGGTCAGC 
TGCAACCCGC GGTGCTCCTC GTTCAGCCAG GTCTTGCATA CGGCCGCCAG AGCTTCCACT 
TGGTCAGGCA GTAGTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA CTTGTCCATC 
AGCGCGCGCG CAGCCTCCAT GCCCTTCTCC CACGCAGACA CGATCGGCAC ACTCAGCGGG 
TTCATCACCG TAATTTCACT TTCCGCTTCG CTGGGCTCTT CCTCTTCCTC TTGCGTCCGC 
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CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGGAAGACGT GAGCGGTGAC 25740 

GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC CGCACCGCTC CCTGGTTTGC 2 5800 

AATTCGCAGC TGCTTAACGA AAGTCAAATT ATCGGTACCT TTGAGCTGCA GGGTCCCTCG 25860 

CCTGACGAAA AGTCCGCGGC TCCGGGGTTG AAACTCACTC CGGGGCTGTG GACGTCGGCT 25920 

TACCTTCGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT CTACGAAGAC 25980 

CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTGCGTCA TTACCCAGGG CCACATTCTT 26040 

GGCCAATTGC AAGCCATCAA CAAAGCCCGC CAAGAGTTTC TGCTACGAAA GGGACGGGGG 26100 

GTTTACTTGG ACCCCCAGTC CGGCGAGGAG CTCAACCCAA TCCCCCCGCC GCCGCAGCCC 26160 

TATCAGCAGC AGCCGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA AGCTGCAGCT 26220 

GCCGCCGCCA CCCACGGACG AGGAGGAATA CTGGGACAGT CAGGCAGAGG AGGTTTTGGA 26280 

CGAGGAGGAG GAGGACATGA TGGAAGACTG GGAGAGCCTA GACGAGGAAG CTTCCGAGGT 26340 

CGAAGAGGTG TCAGACGAAA CACCGTCACC CTCGGTCGCA TTCCCCTCGC CGGCGCCCCA 26400 

GAAATCGGCA ACCGGTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC CGCCGGCACT 26460 

GCCCGTTCGC CGACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG GTAAGTCCAA 26520 

GCAGCCGCCG CCGTTAGCCC AAGAGCAACA ACAGCGCCAA GGCTACCGCT CATGGCGCGG 26580 

GCACAAGAAC GCCATAGTTG CTTGCTTGCA AGACTGTGGG GGCAACATCT CCTTCGCCCG 26640 

CCGCTTTCTT CTCTACCATC ACGGCGTGGC CTTCCCCCGT AACATCCTGC ATTACTACCG 26700 

TCATCTCTAC AGCCCATACT GCACCGGCGG CAGCGGCAGC GGCAGCAACA GCAGCGGCCA 26760 

CACAGAAGCA AAGGCGACCG GATAGCAAGA CTCTGACAAA GCCCAAGAAA TCCACAGCGG 26820 

CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CGAACCCGTA TCGACCCGCG 26880 

AGCTTAGAAA CAGGATTTTT CCCACTCTGT ATGCTATATT TCAACAGAGC AGGGGCCAAG 26940 

AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCGATCCCT CACCCGCAGC TGCCTGTATC 27000 

ACAAAAGCGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC TTCAGTAAAT 27060 

ACTGCGCGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA GCGCGAAAAC 27120 

TACGTCATCT CCAGCGGCCA CACCCGGCGC CAGCACCTGT CGTCAGCGCC ATTATGAGCA 27180 

AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGGACTT GCGGCTGGAG 27240 

CTGCCCAAGA CTACTCAACC CGAATAAACT ACATGAGCGC GGGACCCCAC ATGATATCCC 27300 

GGGTCAACGG AATCCGCGCC CACCGAAACC GAATTCTCTT GGAACAGGCG GCTATTACCA 27360 

CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG TACCAGGAAA 27420 

GTCCCGCTCC CACCACTGTG GTACTTCCCA GAGACGCCCA GGCCGAAGTT CAGATGACTA 27480 

ACTCAGGGGC GCAGCTTGCG GGCGGCTTTC GTCACAGGGT GCGGTCGCCC GGGCAGGGTA 27540 

TAACTCACCT GACAATCAGA GGGCGAGGTA TTCAGCTCAA CGACGAGTCG GTGAGCTCCT 27600 

CGCTTGGTCT CCGTCCGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT CCTTCATTCA 27660 

CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC TCTGGAGGCA 27720 

TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC CCCTTCTCGG 2 7780 
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GACCTCCCGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 27840 
CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 27900 
TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 27960 
TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 28020 
TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 28080 
GACCCTG7GT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTTT 28140 
GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT 
GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 
TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGATT 
AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA GTGCACCACT 
CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TCGCTTTGGT CTGTACCCTA 
CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 
CAAAAAG7TA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 
AACTTGGGCA TGTGGTGGTT CTCCATAGCG CTTATGTTTG TATGCCTTAT TATTATGTGG 
CTCATCTGCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC CATCATTGTG 
CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 
CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG ACCCTTGTTG 
CGCTTTTTTG TGCGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 29880 
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CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTGTCACCCT CACGCTCATC TGCAGCCTCA 29940 
TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC TTTGCATATC 
TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTCTTTAAT 
TATGAAATTT ACTGTGACTT TTCTGCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 
GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 
TTGCTACAAT GAAAAAAGCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCTGTTAT 
GGTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 
ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTATGCTTC CACTGCAACA 
AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 
TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT CTAGAAATGG 
ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 30540 
GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 30600 
GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 30660 
ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 30720 
TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 30780 
ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 30840 
AAAAAAAAAT AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 30900 
TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 30960 
AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 31020 
ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC CTTCAACCCC 31080 
GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 31140 
GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 31200 
CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC 31260 
GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 31320 
AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 31380 
GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA ATCACAGGCC 31440 
CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 31500 
GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 31560 
ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 
GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 
ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 
TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 
AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT TGATGTTAGT 
TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 
AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 31980 
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CCATGGATCA 


TCATGCTCGT 


tv f*/**tr\r* tv m tv 

CACGTGCATA 


CACTTCCTCA 


GGATTACAAG 


A A /"* T\ ftOn/Urn 

AAC AAC CC AT 


TCCTGAATCA 


GCGTAAATCC 


CACGTTGTGC 


ATTGTCAAAG 


TGTTACATTC 


AGCGCGGGTT 


TCTGTCTCAA 


AAGGAGGTAG 


CAACCGAGAT 


CGTGTTGGTC 


GTAGTGTCAT 
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rtuurtU 1 vj\w V e\ 


HvjoGG 1 1 LA r 


GTTTGACGCT 


32040 




Ui i LrtLL 1 AA 


T 1 *^* TV TV TV TV 

TGCACCAAAC 


32100 


UUv, V» 1 rtOrtrt 1 


TTP A TTP AAA 
1 iuAl 1LAAA 


^"9 TV 9k /^9 f"t F99 9k #99^9 

C AAGG C T ATG 


32160 


nArAHPAPAr; 


uluLLAi 1AL 


TV *TT 9k ^9 TV TV TV <*^9J 

AGTAGGAAAC 


32220 


/\V,*Hv_V_MOl_ 1 v^ 


LA1L ILL IAA 


ptrn^i rn 9, M9 « M #P99 9Jk 

CTGTAGACTA 


32280 




AA I GT, GGC AG 


*99^"9 9k 9k 9V P99 9k ^9*99*99 

TCAAATACTT 


32340 


M.O ill vjvjL 1 C 


r* A AT»A fr^ /"<» pr*i ^ 

CAATATCTGG 


TV TV Tv /9M«pn /9 « 9k 

AACAGTTCAA 


32400 


rtrt i vjVj/Vvs i GC 


rp A OT AAA fl A TV 

TACTAAACAA 


^9^ t^^9 ^^9 *^*# #99 ^9 #■« 

TTCCTTCCTG 


32460 


uAlLl 1ACTG 


A A^"'^"»y"*A/'»A/"»r , i 

AAGG C AC AG C 


#^9 f^H 9k ^T79 9ft 

CTATACAAAC 


32520 


TftTOPR A A A T* 

lAICCAAAAr 


CTCACGGTAA 


9k 9k ^9J #99 *^ ^91 mW-m _ 

AACTGCCAAA 


32580 


a A r* A TV TV A 

GGAGACAAAA 


Am 9, 9k 9k ^9 ^9 -T99 t99 

CTAAACCTGT 


9k 4ft ^9 ^NB — — - 

AACACTAACC 


32640 


PSPRPR A PTP 

(jALAUAAL 1 C 

• 


^♦A A /Tt"'/""! /**t TV rri tv 

C AAGTG CAT A 


CTCTATGTCA 


32700 


A 1 I'M ' A A TV"* A A A 

A J. I AA1GAAA 


rn tv ^nHHm /^9 » /^t 

TATTTGCCAC 


9k j — | i—j J~#~t _r~i_ jwb 

ATCCTCTTAC 


32760 


AlLbl X lulu 


TTATGTTTCA 


TV ^9^in%i*9 MM fV«#«ft 

ACGTGTTTAT 


32820 


1 I LA i 1 L AGT 


TV /^Jfl^TV "IT TV /™i <*9/^9/™T 

AGTATAG CCC 


TV ^*9 #*9J 9k ^91 <m • 

CACCACCACA 


32880 


AAL 1 LAL AGA 


ACCCTAGTAT 


#99^9>9k 9k <#9ft #99 ^kM 

TCAACCTGCC 


32940 


CCTTTCTCCC 


CGGCTGGCCT 


T AAAAAG CAT 


33000 


^f^^^ "^*TTTV ^Jrt TA ^T94^n ^% 

TGTTATATTC 


9k ^9 9v /k]/« •TMMfHMk 

CACACGGTTT 


CCTGTCGAGC 


33060 


CCCGGGCAGC 


*TT «^9 9V nrTrtrl 9k 9k ^9 m 

TCACTTAAGT 


TCATGTCGCT 


33120 


AAC TTG CGG T 


TGCTTAACGG 


GCGGCGAAGG 


33180 


TV TV Tt m^l^nV/^/'i 

AT AAT CG TG C 


9k f*99 ,*T^9 9k ^9 X9l 9k #99 *M 

ATCAGGATAG 


GGCGGTGGTG 


33240 


CCGCCGCTCC 


r9>9 P99 ^9: ^9 9k ^91^*9 

GTCCTGCAGG 


AATACAACAT 


33300 


CGCCCGCAGC 


ATAAGGCGCC 


TTGTCCTCCG 


33360 


TV Prt 9k ^9J 9k ^9 9k ***m 

ATCAGCACAG 


#99 9k 9k MMt >(9ft ^9t v m 

TAACTGCAGC 


ACAGCACCAC 


33420 


^9i91 ^9 m 9k fff ^9 ^9 9% 

GCTGTATCCA 


AAGCTCATGG 


CGGGGACCAC 


33480 


CAGGTAGATT 


TV 9V **9l #99 a«9> «*9J ^9t^9> 9\ >9 

AAGTGGCGAC 


^9 ^9 <T#9 #J1 ^m. ___ „=. 

CCCTCATAAA 


33540 


A rw/1»T , 'T^/-*'n A TV 

CATGTTGTAA 


4 I iril^ TV 9k /9^im| 

TTCACCACCT 


#*9 #^9 ##9 «T9I dT9j wwm 9k _r_L 

CCCGGTACCA 


33600 


LAL LAL LA 1 L 


^fTT A TV TV #™1 TV /^i ^ 

CTAAACCAGC 


rT9^9j j#9J #*9 «T9j 9k 9k 9k 9k <#9i 

TGGCCAAAAC 


33660 


GGGACx GGAA 


A A A y*^ TV rn 

CAATGACAGT 


y^k ^9 9k j#9j 9k ^91 #*9 <T9i ^9 9ft 

GGAGAGCCCA 


33720 


/"» A r P/""* ATA A 

LATGATATCA 


A T<"' ' I'll tv yi 

ATGTTGGCAC 


TV TV i*T 9k j#9j «m <#v ^9 ^9 

AACACAGGCA 


33780 


CTCCTCCCGC 


fT^f99 9k 9k 9V #^9 9k 

GTTAGAACCA 


TATCCCAGGG 


33840 


CACACTGCAG 


GGAAGACCTC 


GCACGTAACT 


33900 


GGGCAGCAGC 


GGATGATCCT 


CCAGTATGGT 


33960 


ACGATCCCTA 


CTGTACGGAG 


TGCGCCGAGA 


34020 


GCCAAATGGA 


ACGCCGGACG 


TAGTCATATT 


34080 
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TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG TCTCGCCGCT 34140 
TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC AGGCGCCCCC 34200 



TGGCT7CGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTGATAACA TCCACCACCG 
CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC ACGGGAGGAG 
CGGGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC CAAAACCTCA 
AAATGAAGAT CTATTAAGTG AACGCGCTCC CCTCCGGTGG CGTGGTCAAA CTCTACAGCC 
AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG GCAAACGGCC 
CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT GAATCTCCTC TATAAACATT 
CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA TATATCTCTA 
AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT GCTCCAGAGC GCCCTCCACC 
TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG ACCTGTATAA 
GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG TAGGTCCCTT CGCAGGGCCA 
GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC CACTTCCCCG CCAGGAACCT 
TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA ACCAGCGTAG 
CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT GCAAGGTGCT GCTCAAAAAA 
TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG CAGATAAAGG 
CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT TTCTCTCAAA CATGTCTGCG 
GGTTTCTGCA TAAACACAAA ATAAAATAAC AAAAAAACAT TTAAACATTA GAAGCCTGTC 
TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC TACGGCCATG CCGGCGTGAC 
CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA CAGCTCCTCG GTCATGTCCG 
GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT CATCGGTCAG TGCTAAAAAG 
CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA GAGACAACAT TACAGCCCCC 
ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA AAAACCCTCC 
TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC ACAGCGGCAG 
CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC ACTCGACACG 
GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG TATATATAGG 
ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC GCACGCGAAC 
CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA CTTCCGTTTT 
CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT ACAAGTTACT 
CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC CGCGCCACGT CACAAACTCC 
ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA TGATG 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35935 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = 11 DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

TTCATTTTAT GTTTCAGGTT CAGGG 2 , 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTACCGCCAC ACTCGCAGGG 2C 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



TTATTTTGGA 


TTGAAGCCAA 


TATGATAATG 


AGGGGGTGGA 


GTTTGTGACG 


TGGCGCGGGG 


60 


CGTGGGAACG 


GGGCGGGTGA 


CGTAGTAGTG 


TGGCGGAAGT 


GTGATGTTGC 


AAGTGTGGCG 


120 


GAACACATGT 


AAGCGACGGA 


TGTGGCAAAA 


GTGACGTTTT 


TGGTGTGCGC 


CGGATCCACA 


180 


GGACGGGTGT 


GGTCGCCATG 


ATCGCGTAGT 


CGATAGTGGC 


TCCAAGTAGC 


GAAGCGAGCA 


240 


GGACTGGGCG 


GCGGCCAAAG 


CGGTCGGACA 


GTGCTCCGAG 


AACGGGTGCG 


CATAGAAATT 


300 


GCATCAACGC 


ATATAGCGCT 


AGCAGCACGC 


CATAGTGACT 


GGCGATGCTG 


TCGGAATGGA 


360 


CGATATCCCG 


CAAGAGGCCC 


GGCAGTACCG 


GCATAACCAA 


GCCTATGCCT 


ACAGCATCCA 


420 


GGGTGACGGT 


GCCGAGGATG 


ACGATGAGCG 


CATTGTTAGA 


TTTCATACAC 


GGTGCCTGAC 


480 


TGCGTTAGCA 


ATTTAACTGT 


GATAAACTAC 


CGCATTAAAG 


CTTATCGATG 


ATAAGCTGTC 


540 


AAACATGAGA 


ATTCTTGAAG 


ACGAAAGGGC 


CTCGTGATAC 


GCCTATTTTT 


ATAGGTTAAT 


600 


GTCATGATAA 


TAATGGTTTC 


TTAGACGTCA 


GGTGGCACTT 


TTCGGGGAAA 


TGTGCGCGGA 


660 


ACCCCTATTT 


GTTTATTTTT 


CTAAATACAT 


TCAAATATGT 


ATCCGCTCAT 


GAGACAATAA 


720 


CCCTGATAAA 


TGCTTCAATA 


ATATTGAAAA 


AGGAAGAGTA 


TGAGTATTCA 


ACATTTCCGT 


780 


GTCGCCCTTA 


TTCCCTTTTT 


TGCGGCATTT 


TGCCTTCCTG 


TTTTTGCTCA 


CCCAGAAACG 


840 


CTGGTGAAAG 


TAAAAGATGC 


TGAAGATCAG 


TTGGGTGCAC 


GAGTGGGTTA 


CATCGAACTG 


900 


GATCTCAACA 


GCGGTAAGAT 


CCTTGAGAGT 


TTTCGCCCCG 


AAGAACGTTT 


TCCAATGATG 


960 


AGCACTTTTA 


AAGTTCTGCT 


ATGTGGCGCG 


GTATTATCCC 


GTGTTGACGC 


CGGGCAAGAG 


1020 


CAACTCGGTC 


GCCGCATACA 


CTATTCTCAG 


AATGACTTGG 


TTGAGTACTC 


ACCAGTCACA 


1080 
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GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG 114 0 

AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 1200 

GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG 1260 

AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGCAGCAAT GGCAACAACG 1320 

TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC 1380 

TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 1440 

TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 1500 

GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT 1560 

ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 1620 

CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 1680 

AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG 1740 

TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT 1800 

TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 1860 

TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG 1920 

CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 1980 

GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 2040 

GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG 2100 
TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 2160 
CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 2220 
GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 2280 
GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 2340 
TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT 2400 
TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT 2460 
GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA 252 0 
ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT 2 580 
CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT GCACTCTCAG TACAATCTGC 2640 
TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG 2700 
CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG 2760 
CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC 2820 
CGTCATCACC GAAACGCGCG AGGCAGTCTA GACAATAGTA GTACGGATAG CTGTGACTCC 2880 
GGTCCTTCTA ACACACCTCC TGAGATACAC CCGGTGGTCC CGCTGTGCCC CATTAAACCA 2940 
GTTGCCGTGA GAGTTGGTGG GCGTCGCCAG GCTGTGGAAT GTATCGAGGA CTTGCTTAAC 3000 
GAGCCTGGGC AACCTTTGGA CTTGAGCTGT AAACGCCCCA GGCCATAAGG TGTAAACCTG 3060 
TGATTGCGTG TGTGGTTAAC GCCTTTGTTT GCTGAATGAG TTGATGTAAG TTTAATAAAG 3120 
GGTGAGATAA TGTTTAACTT GCATGGCGTG TTAAATGGGG CGGGGCTTAA AGGGTATATA 3180 
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ATPPP **omrir* 


PPPT'TV T« WOmT 

GGCTAATCTT 


GGTTACATCT 


GACCTCATGG 


AGGCTTGGGA GTGTTTGGAA 


3240 


<"2 A T , T ,r P""'»p»Tip 
On I 1 1 - * V_ 1 Vj 


P'T'PTPPP'T 1 A TV 

CTGTGCGTAA 


CTTGCTGGAA 


CAGAGCTCTA 


ACAGTACCTC 


TTGGTTTTGG 


3300 


Rr!^ , l"l"i , n i PPT 

•tt\JVJ 1 1 A i(j 1 


GGGGCTCATC 


CCAGGCAAAG 


TTAGTCTGCA 


GAATTAAGGA 


GGATTAGAAG 


3360 


TP.PP A T"T ,r PP 


AAGAGCTTTT 


^1 T\ TV TV m» ,mft«M «k 

GAAATCCTGT 


GGTGAGCTGT 


TTGATTCTTT 


GAATCTGGGT 


3420 


V- Au C AVjvs CGC 


TTTTCCAAGA 


GAAGGTCATC 


AAGACTTTGG 


ATTTTTCCAC 


ACCGGGGCGC 


3480 


/■<• ptp p c p pt p 
GC1GCGGC IG 


CTGTTGCTTT 


TTTGAGTTTT 


ATAAAGGATA 


AATGGAGCGA AGAAACCCAT 


3540 


C TGAG CGGGG 


GGTACCTGCT 


GGATTTTCTG 


GCCATGCATC 


TP TP P A P A PP 

I GTGGAG AGC 


GGTTGTGAGA 


3600 


P TV P TV TV P T» t> tyy r+ 

C ACAAG AAT C 


GCCTGCTACT 


GTTGTCTTCC 


GTCCGCCCGG 


CGATAATACC 


GACGGAGGAG 


3660 


CAGCAGCAGC 


AGCAGGAGGA 


AGCCAGGCGG 


CGGCGGCAGG 


AGCAGAGCCC 


ATGGAACCCG 


3720 


AGAGC cggcc 


TGGACCCTCG 


GGAATGAATG 


TTGTACAGGT 


GGCTGAACTG 


TATCCAGAAC 


3780 


TP A P A pp p a t 

I GAGACGCAT 


TTTGACAATT 


ACAGAGGATG 


GGCAGGGGCT 


AAAGGGGGTA 


AAGAGGGAGC 


3840 


GGGGGGCTTG 


TGAGGCTACA 


GAGGAGGCTA 


GGAATCTAGC 


TTTTAGCTTA 


ATGACCAGAC 


3900 


ALLvj 1CCTGA 


GTGTATTACT 


TTTCAACAGA 


TCAAGGATAA 


TTGCGCTAAT 


GAGCTTGATC 


3960 


TP P TP P P 7V 

1 GC I GGCGCA 


T\ T% /*"T f VI TV ^ffi j*>a 

GAAGTATTCC 


ATAGAGCAGC 


TGACCACTTA 


CTGGCTGCAG 


CCAGGGGATG 


4020 


7V TTT'Tr' TV tv 

Al 1 1 IGAGGA 


GGCTATTAGG 


GTATATGCAA 


AGGTGGCACT 


TAGGCCAGAT 


TGCAAGTACA 


4080 


APATPi\r"P1\ a 

AGA 1 CAGCAA 


ACTTGTAAAT 


ATCAGGAATT 


GTTGCTACAT 


TTCTGGGAAC 


GGGGCCGAGG 


4140 


ifpp A P T* A /"» TV 

rGGAG^iTAGA 


TACGGAGGAT 


AGGGTGGCCT 


TTAGATGTAG 


CATGATAAAT ATGTGGCCGG 


4200 


GGGTGCTTGG 


CATGGACGGG 


GTGGTTATTA 


TGAATGTAAG 


GTTTACTGGC 


CCCAATTTTA 


4260 


GCGGTACGGT 


TTTCCTGGCC 


AATACCAACC 


TTATCCTACA 


CGGTGTAAGC 


TTCTATGGGT 


4320 


TTAACAATAC 


CTGTGTGGAA 


GCCTGGACCG 


ATGTAAGGGT 


TCGGGGCTGT 


GCCTTTTACT 


4380 


gctgctggaa 


GGGGGTGGTG 


TGTCGCCCCA 


AAAGCAGGGC 


TTCAATTAAG 


AAATGCCTCT 


4440 


TTGAAAGGTG 


TACCTTGGGT 


ATCCTGTCTG 


AGGGTAACTC 


CAGGGTGCGC 


CACAATGTGG 


4500 


CCTCCGACTG 


TGGTTGCTTC 


ATGCTAGTGA 


AAAGCGTGGC 


TGTGATTAAG 


CATAACATGG 


4560 


tatgtggcaa 


CTGCGAGGAC 


AGGGCCTCTC 


AGATGCTGAC 


CTGCTCGGAC 


GGCAACTGTC 


4620 


ACCTGCTGAA 


GACCATTCAC 


GTAGCCAGCC 


ACTCTCGCAA 


GGCCTGGCCA 


GTGTTTGAGC 


4680 


7\ rp A A P A A /*"T1 

A 1 AAC AT ACT 


GACCCGCTGT 


TCCTTGCATT 


TGGGTAACAG 


GAGGGGGGTG 


TTCCTACCTT 


4740 


A P P A A TP TV TV 

AC C AATGC AA 


TTTGAGTCAC 


ACTAAGATAT 


TGCTTGAGCC 


CGAGAGCATG 


TCCAAGGTGA 


4800 


A PPTP A A PP/"^ 

ACL I GAACGG 


GGTGTTTGAC 


ATGACCATGA 


AGATCTGGAA 


GGTGCTGAGG 


TACGATGAGA 


4860 


f^f^f^ A P P TV 

CCCGCACCAG 


GTGCAGACCC 


TGCGAGTGTG 


GCGGTAAACA 


TATTAGGAAC 


CAGCCTGTGA 


4920 


TGCTGGATGT 


GACCGAGGAG 


CTGAGGCCCG 


ATCACTTGGT 


GCTGGCCTGC 


ACCCGCGCTG 


4980 


AGTTTGGCTC 


TAGCGATGAA 


GATACAGATT 


GAGGTACTGA 


AATGTGTGGG 


CGTGGCTTAA 


5040 


PPPTPP^A A A 

VjVjVj a GGoAAA 


GAATATATAA 


GGTGGGGGTC 


TTATGTAGTT 


TTGTATCTGT 


TTTGCAGCAG 


5100 


CCGCCGCCGC 


CATGAGCACC 


AACTCGTTTG 


ATGGAAGCAT 


TGTGAGCTCA 


TATTTGACAA 


5160 


CGCGCATGCC 


CCCATGGGCC 


GGGGTGCGTC 


AGAATGTGAT 


GGGCTCCAGC 


ATTGATGGTC 


5220 


GCCCCGTCCT 


GCCCGCAAAC 


TCTACTACCT 


TGACCTACGA 


GACCGTGTCT 


GGAACGCCGT 


5280 
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TGGAGACTGC AGCCTCCGCC GCCGCTTCAG CCGCTGCAGC CACCGCCCGC GGGATTGTGA 534 0 

CTGACTTTGC TTTCCTGAGC CCGCTTGCAA GCAGTGCAGC TTCCCGTTCA TCCGCCCGCG 5400 

ATGACAAGTT GACGGCTCTT TTGGCACAAT TGGATTCTTT GACCCGGGAA CTTAATGTCG 5460 

TTTCTCAGCA GCTGTTGGAT CTGCGCCAGC AGGTTTCTGC CCTGAAGGCT TCCTCCCCTC 552 0 

CCAATGCGGT TTAAAACATA AATAAAAAAC CAGACTCTGT TTGGATTTGG ATCAAGCAAG 5580 

TGTCTTGCTG TCTTTATTTA GGGGTTTTGC GCGCGCGGTA GGCCCGGGAC CAGCGGTCTC 5640 

GGTCGTTGAG GGTCCTGTGT ATTTTTTCCA GGACGTGGTA AAGGTGACTC TGGATGTTCA 5700 

GATACATGGG CATAAGCCCG TCTCTGGGGT GGAGGTAGCA CCACTGCAGA GCTTCATGCT 5760 

GCGGGGTGGT GTTGTAGATG ATCCAGTCGT AGCAGGAGCG CTGGGCGTGG TGCCTAAAAA 5820 

TGTCTTTCAG TAGCAAGCTG ATTGCCAGGG GCAGGCCCTT GGTGTAAGTG TTTACAAAGC 5880 

GGTTAAGCTG GGATGGGTGC ATACGTGGGG ATATGAGATG CATCTTGGAC TGTATTTTTA 5940 

GGTTGGCTAT GTTCCCAGCC ATATCCCTCC GGGGATTCAT GTTGTGCAGA ACCACCAGCA 6000 

CAGTGTATCC GGTGCACTTG GGAAATTTGT CATGTAGCTT AGAAGGAAAT GCGTGGAAGA 6060 

ACTTGGAGAC GCCCTTGTGA CCTCCAAGAT TTTCCATGCA TTCGTCCATA ATGATGGCAA 6120 

TGGGCCCACG GGCGGCGGCC TGGGCGAAGA TATTTCTGGG ATCACTAACG TCATAGTTGT 6180 

GTTCCAGGAT GAGATCGTCA TAGGCCATTT TTACAAAGCG CGGGCGGAGG GTGCCAGACT 6240 

GCGGTATAAT GGTTCCATCC GGCCCAGGGG CGTAGTTACC CTCACAGATT TGCATTTCCC 63 00 

ACGCTTTGAG TTCAGATGGG GGGATCATGT CTACCTGCGG GGCGATGAAG AAAACGGTTT 6360 

CCGGGGTAGG GGAGATCAGC TGGGAAGAAA GCAGGTTCCT GAGCAGCTGC GACTTACCGC 6420 

AGCCGGTGGG CCCGTAAATC ACACCTATTA CCGGGTGCAA CTGGTAGTTA AGAGAGCTGC 6480 

AGCTGCCGTC ATCCCTGAGC AGGGGGGCCA CTTCGTTAAG CATGTCCCTG ACTCGCATGT 6540 
TTTCCCTGAC CAAATCCGCC AGAAGGCGCT CGCCGCCCAG CGATAGCAGT TCTTGCAAGG 6600 
AAGCAAAGTT TTTCAACGGT TTGAGACCGT CCGCCGTAGG CATGCTTTTG AGCGTTTGAC 6660 
CAAGCAGTTC CAGGCGGTCC CACAGCTCGG TCACCTGCTC TACGGCATCT CGATCCAGCA 6720 
TATCTCCTCG TTTCGCGGGT TGGGGCGGCT TTCGCTGTAC GGCAGTAGTC GGTGCTCGTC 6780 
CAGACGGGCC AGGGTCATGT CTTTCCACGG GCGCAGGGTC CTCGTCAGCG TAGTCTGGGT 6840 
CACGGTGAAG GGGTGCGCTC CGGGCTGCGC GCTGGCCAGG GTGCGCTTGA GGCTGGTCCT 6 900 
GCTGGTGCTG AAGCGCTGCC GGTCTTCGCC CTGCGCGTCG GCCAGGTAGC ATTTGACCAT 6 960 
GGTGTCATAG TCCAGCCCCT CCGCGGCGTG GCCCTTGGCG CGCAGCTTGC CCTTGGAGGA 7020 
GGCGCCGCAC GAGGGGCAGT GCAGACTTTT GAGGGCGTAG AGCTTGGGCG CGAGAAATAC 7080 
CGATTCCGGG GAGTAGGCAT CCGCGCCGCA GGCCCCGCAG ACGGTCTCGC ATTCCACGAG 7140 
CCAGGTGAGC TCTGGCCGTT CGGGGTCAAA AACCAGGTTT CCCCCATGCT TTTTGATGCG 7200 
TTTCTTACCT CTGGTTTCCA TGAGCCGGTG TCCACGCTCG GTGACGAAAA GGCTGTCCGT 7260 
GTCCCCGTAT ACAGACTTGA GAGGCCTGTC CTCGAGCGGT GTTCCGCGGT CCTCCTCGTA 7320 
TAGAAACTCG GACCACTCTG AGACAAAGGC TCGCGTCCAG GCCAGCACGA AGGAGGCTAA 7380 
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n r vnrzn ta nrrir* 


1 Avj\_VjO 1 1 


1 G 1 LCACTAG 




ILi 1 v_vjLjv_ A 1 


^ TV /"f ^» TV TV rr* 

uAAGGAAGGT 


vjvjvj lbl 1LL1 


G AAGG buu 0 G 


'^TAT^TV TV IV TV ^« 

TATAAAAGGG 


V—VJV_M 1 Lot 1 Vj 


1 V. 1 (aGGAGGG 


L.CAGCTGTTG 


VjAL. 1 lv.1 G\-G 


v- 1 AAGA 1 1 GT 


CAGTTTCCAA 




G CTTTG AGGG 


TGGCCGCATC 


G 1 LAAbLTTG 


GTGG CAAAC G 


ACCCGTAGAG 


v_ AGGG ill GG 


TTTTTGTCGC 


GATCGGCGCG 


1 1 v^Vjv^vjLvjuA 


ACGCACCGCC 


ATTCGGGAAA 


v_Av_ Vj v_ \_ rtA 


GCGCGGTTGT 


GCAGGGTGAC 


1 AvjvjUvjL. 1 v-tj 


1 rGGTCCAGC 


AGAGGCGGCC 


vjl v_ 1 AvjL. 1 (jL 


GTCTCGTC CG 


GGGGGTCTGC 


l_GGG 1 LGAAG 


TV /"^ rn r^TT* TV rrT ^ if 

TAGTCTATCT 


TGCATCCTTG 


vjv»L.AAk3v».vjv_.G 


CGCTCGTATG 


GGTTGAGTGG 


<jvjAIjGGG 1 AL. 


ATGCCGCAAA 


TGTCGTAAAC 


i Vj 1 AGGG 1 AG 


CATCTTCCAC 


CGCGGATGCT 


GGGAGCGAGG 


AGGTCGGGAC 


CGAGGTTGCT 


LTGCCToAAG 


ATGGCATGTG 


AGTTGGATGA 


GGCGTCTGTG 


AGACCTACCG 


CGTCACGCAC 


GACCAGCTCG 


GCGGTGACCT 


GCACGTCTAG 


/*^*TT^ TV fT^ TV ^lmfriii 

GTCATACTTA 


TCCTGTCCCT 


TTTTTTTCCA 


GTCTTTCCAG 


TACTCTTGGA 


TCGGAAACCC 


•TTV ^ TV TV / *'TV*^"« 

gtagaactgg 


TTGACGGCCT 


GGTAGGCGCA 


CTGCGCvjGCC 


TTCCGGAGCG 


AGGTGTGGGT 


GAGGTACTGG 


TATTTGAAGT 


CAGTGTCGTC 


t-Lj 1 (jGGv_ 1 TT 


TTGGAACGCG 


GATTTGGCAG 


1 LLLbLGCGA 


GGCATAAAGT 


TGCGTGTGAT 


VjiI 1AAI 1AGG 


TGGGCGGCGA 


GCACGATCTC 


G 1 AAAG 1 1LC 


TV TV TV TV /^l /"^ 

AAGAAGCGCG 


GGATGCCCTT 


00 1 GAGL 1 GT 


TCAGGGGAGC 


TGAGCCCGTG 


g r tggaagcg 


Th ^^^^ f4 fk him m ^« 

ACGAATGAGC 


wmmt mm* mm m, mm m* mm — - - - - 

TCCACAGGTC 


AAAGGTCCTA 


AACTGGCGAC 


CTATGGCCAT 


CGGGTCTTGT 


TCCCAGCGGT 


CCCATCCAAG 


TAGAGGCTCA 


TCTCCGCCGA 


ACTTCATGAC 


ggcccccatc 


CAAGTATAGG 


TCTCTACATC 
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GGGGT CC AC T 


a mm \ -*mm - mm \ f^Hk mm mm mm mm mm mm 

CGCTCCAGGG 


TGTGAAGACA 


7440 


GATTGGTTTG 


m Tk mmm MmjM mmm mm mm mm 

TAGGTGTAGG 


CCACGTGACC 


7500 


GGTGGGGGCG 


CGTTCGTCCT 


CACTCTCTTC 


7560 


GGGTGAGTAC 


m 9 ** m*** m**\ *m* mm p_m a _ 

TCCCTCTGAA 


AAGCGGGCAT 


7620 


AAACGAGGAG 


GATTTGATAT 


TCACCTGGCC 


7680 


CATCTGGTCA 


GAAAAGACAA 


TCTTTTTGTT 


7740 


mm mm mm mm **m***kmm mm 

GGCGTTGGAC 


AGCAACTTGG 


CGATGGAGCG 


7800 


CTCCTTGGCC 


GCGATGTTTA 


GCTGCACGTA 


7860 


GACGGTGGTG 


CGCTCGTCGG 


GCACCAGGTG 


7920 


AAGGTCAACG 


CTGGTGGCTA 


CCTCTCCGCG 


7980 


f + mn* ^m* mm mm 

GCCCTTGCGC 


GAGCAGAATG 


GCGGTAGGGG 


8040 


*^*i ^*l **m mm mm* 

GTCCACGGTA 


AAGACCCCGG 


GCAGCAGGCG 


8100 


m^m m, m^ M**mm* mmm^* mx Mmm mm 

CAAGTCTAGC 


GCCTGCTGCC 


ATGCGCGGGC 


8160 


GGGACCCCAT 


GGCATGGGGT 


GGGTG AG CGC 


8220 


mm mm* m* mm mv 1— 1 -mm mm mm* mm 

GTAGAGGGGC 


TCTCTGAGTA 


TTCCAAGATA 


8280 


mm\ mm mm mm mm mm mm m. mm mmmt 

GGCGCGCACG 


TAATCGTATA 


GTTCGTGCGA 


8340 


ACGGGCGGGC 


TGCTCTGCTC 


GGAAGACTAT 


8400 


TATGGTTGGA 


CGCTGGAAGA 


CGTTGAAGCT 


8460 


GAAGGAGGCG 


TAGGAGTCGC 


GCAGCTTGTT 


8520 


GGCGCAGTAG 


TCCAGGGTTT 


CCTTGATGAT 


8580 


CAGCTCGCGG 


TTGAGGACAA 


ACTCTTCGCG 


8640 


GTCGGCCTCC 


GAACGGTAAG 


AGCCTAGCAT 


8700 


GCATCCCTTT 


TCTACGGGTA 


GCGCGTATGC 


8760 


GAGCGCAAAG 


GTGTCCCTGA 


CCATGACTTT 


8820 


GCATCCGCCC 


TGCTCCCAGA 


GCAAAAAGTC 


8880 


mm mm mm mm m. * _-_ _-**. — - — 

GGCGAAGGTG 


ACATCGTTGA 


AGAGTATCTT 


8940 


GCGGAAGGGT 


CCCGGCACCT 


CGGAACGGTT 


9000 


^^Vrn Tl «, v, >ta mm% mmm mmm 

GTCAAAGCCG 


TTGATGTTGT 


GGCCCACAAT 


9060 


|M *** mm mm m. mmm mm mmm 

GATGGAAGGC 


AATTTTTTAA 


GTTCCTCGTA 


9120 


CTCTGAAAGG 


GCCCAGTCTG 


CAAGATGAGG 


9180 


ACGGGCCATT 


AGCATTTGCA 


GGTGGTCGCG 


9240 


TTTTTCTGGG 


GTGATGCAGT 


AGAAGGTAAG 


9300 


GTTCGCGGCT 


AGGTCTCGCG 


CGGCAGTCAC 


9360 


CAGCATGAAG 


GGCACGAGCT 


GCTTCCCAAA 


9420 


GTAGGTGACA 


AAGAGACGCT 


CGGTGCGAGG 


9460 
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ATGCGAGCCG ATCGGGAAGA ACTGGATCTC CCGCCACCAA TTGGAGGAGT GGCTATTGAT 9540 

GTGGTGAAAG TAGAAGTCCC TGCGACGGGC CGAACACTCG TGCTGGCTTT TGTAAAAACG 96 00 

TGCGCAGTAC TGGCAGCGGT GCACGGGCTG TACATCCTGC ACGAGGTTGA CCTGACGACC 9660 

GCGCACAAGG AAGCAGAGTG GGAATTTGAG CCCCTCGCCT GGCGGGTTTG GCTGGTGGTC 9720 
TTCTACTTCG GCTGCTTGTC CTTGACCGTC TGGCTGCTCG AGGGGAGTTA CGGTGGATCG 9780 
GACCACCACG CCGCGCGAGC CCAAAGTCCA GATGTCCGCG CGCGGCGGTC GGAGCTTGAT 9840 
GACAACATCG CGCAGATGGG AGCTGTCCAT GGTCTGGAGC TCCCGCGGCG TCAGGTCAGG 9900 
CGGGAGCTCC TGCAGGTTTA CCTCGCATAG ACGGGTCAGG GCGCGGGCTA GATCCAGGTG 9960 

ATACGTAATT TCCAGGGGCT GGTTGGTGGC GGCGTCGATG GCTTGCAAGA GGCCGCATCC 10020 

CCGCGGCGCG ACTACGGTAC CGCGCGGCGG GCGGTGGGCC GCGGGGGTGT CCTTGGATGA 10080 

TGCATCTAAA AGCGGTGACG CGGGCGAGCC CCCGGAGGTA GGGGGGGCTC CGGACCCGCC 10140 

GGGAGAGGGG GCAGGGGCAC GTCGGCGCCG CGCGCGGGCA GGAGCTGGTG CTGCGCGCGT 10200 

AGGTTGCTGG CGAACGCGAC GACGCGGCGG TTGATCTCCT GAATCTGGCG CCTCTGCGTG 10260 

AAGACGACGG GCCCGGTGAG CTTGAGCCTG AAAGAGAGTT CGACAGAATC AATTTCGGTG 10320 

TCGTTGACGG CGGCCTGGCG CAAAATCTCC TGCACGTCTC CTGAGTTGTC TTGATAGGCG 10380 

ATCTCGGCCA TGAACTGCTC GATCTCTTCC TCCTGGAGAT CTCCGCGTCC GGCTCGCTCC 10440 

ACGGTGGCGG CGAGGTCGTT GGAAATGCGG GCCATGAGCT GCGAGAAGGC GTTGAGGCCT 10500 

CCCTCGTTCC AGACGCGGCT GTAGACCACG CCCCCTTCGG CATCGCGGGC GCGCATGACC 10560 

ACCTGCGCGA GATTGAGCTC CACGTGCCGG GCGAAGACGG CGTAGTTTCG CAGGCGCTGA 10620 

AAGAGGTAGT TGAGGGTGGT GGCGGTGTGT TCTGCCACGA AGAAGTACAT AACCCAGCGT 10680 

CGCAACGTGG ATTCGTTGAT ATCCCCCAAG GCCTCAAGGC GCTCCATGGC CTCGTAGAAG 10740 

TCCACGGCGA AGTTGAAAAA CTGGGAGTTG CGCGCCGACA CGGTTAACTC CTCCTCCAGA 10800 

AGACGGATGA GCTCGGCGAC AGTGTCGCGC ACCTCGCGCT CAAAGGCTAC AGGGGCCTCT 10860 
TCTTCTTCTT CAATCTCCTC TTCCATAAGG GCCTCCCCTT CTTCTTCTTC TGGCGGCGGT 10920 
GGGGGAGGGG GGACACGGCG GCGACGACGG CGCACCGGGA GGCGGTCGAC AAAGCGCTCG 10980 
ATCATCTCCC CGCGGCGACG GCGCATGGTC TCGGTGACGG CGCGGCCGTT CTCGCGGGGG 11040 
CGCAGTTGGA AGACGCCGCC CGTCATGTCC CGGTTATGGG TTGGCGGGGG GCTGCCATGC 11100 
GGCAGGGATA CGGCGCTAAC GATGCATCTC AACAATTGTT GTGTAGGTAC TCCGCCGCCG 11160 
AGGGACCTGA GCGAGTCCGC ATCGACCGGA TCGGAAAACC TCTCGAGAAA GGCGTCTAAC 11220 
CAGTCACAGT CGCAAGGTAG GCTGAGCACC GTGGCGGGCG GCAGCGGGCG GCGGTCGGGG 112 80 
TTGTTTCTGG CGGAGGTGCT GCTGATGATG TAATTAAAGT AGGCGGTCTT GAGACGGCGG 11340 
ATGGTCGACA GAAGCACCAT GTCCTTGGGT CCGGCCTGCT GAATGCGCAG GCGGTCGGCC 11400 
ATGCCCCAGG CTTCGTTTTG ACATCGGCGC AGGTCTTTGT AGTAGTCTTG CATGAGCCTT 11460 
TCTACCGGCA CTTCTTCTTC TCCTTCCTCT TGTCCTGCAT CTCTTGCATC TATCGCTGCG 11520 
GCGGCGGCGG AGTTTGGCCG TAGGTGGCGC CCTCTTCCTC CCATGCGTGT GACCCCGAAG 11580 
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GCTGAAGCAG GGCTAGGTCG 


GCGACAACGC 


GCTCGGCTAA 


T 21 T rz a c c Trr 


11640 




TGAGGGTAGA 


CTGGAAGTCA 


TCCATGTCCA 


CAAAGCGGTG 


w 1M 1 Vaw*Jww w 


1 1 inn 

11700 


G 1 G 1 1 GATGG 


TGTAAGTGCA 


GTTGGCCATA 


ACGGACCAGT 


TAACGGTCTG 




11760 




CGGTGTACCT 


GAGACGCGAG 


TAAGCCCTCG 


AGTCAAATAC 




1182 0 


L AAG 1 LLGLA 


CCAGGTACTG 


GTATCCCACC 


aaaaagtgcg 


GCGGCGGCTG 


w- wvjVj 1 AGAGo 


11880 


GGLLAGLGTA 


GGGTGGCCGG 


GGCTCCGGGG 


GCGAGATCTT 


CCAACATAAG 


OwoA 1 vjA I A 1 


11940 


lcgtagatgt 


ACCTGGACAT 


CCAGGTGATG 


CCGGCGGCGG 


TGGTGGAGGC 




12000 


1 LGLGGACGC 


GGTTCCAGAT 


GTTGCGCAGC 


GGCAAAAAGT 


GCTCCATGGT 


v-vju-w-AwGC I L 


12060 


1 GGCCGGTCA 


GGCGCGCGCA ATCGTTGACG 


CTCTACCGTG 


CAAAAGGAGA 


oww 1 1 AAGw 


12120 


vjVjVjL AL 1L1 1 


CCGTGGTCTG 


GTGGATAAAT 


TCGCAAGGGT 


ATCATGGCGG 




12180 




GTATCCGGCC 


GTCCGCCGTG 

— — - -taw — ip w^ ~ v ^tai» NJ A VJ 


ATCCATGCGG 


TTACCGCCCG 


wV3 i VJ 1 wGAAw 


12240 




GACGTCAGAC 


AACGGGGGAG 


TGCTCCTTTT 


GGCTTCCTTC 




12300 


ubt AGLaGCG 


CTAGCTTTTT 


TGGCCACTGG 


CCGCGCGCAG 


CGTAAGCGGT 


l Aw-vj w I GGAA 


12360 


AGL GAAAGCA 


TTAAGTGGCT 


CGCTCCCTRT 


AGCCGGAGGG 


TTATTTTCCA 


AGGGTTG AG T 


12420 


CGCGGGACCC 


CCGGTTCGAG 


TCTCGGACCG 


GCCGGACTGC 


GGCGAACGGG 


GGI ITGCCTC 


12480 


LLCGTCATGC 


AAGACCCCGC 


TTGCAAATTC 


CTCCGGAAAC 


AGGGACGAGC 


PPPTTTWUmo 

ILL TTTTTTG 


12540 


CTTTTCwCAG 


ATGCATCCGG 


TGCTGCGGCA 


GATGCGCCCC 


CCTCCTCAGC 


AGCGGCAAGA 


12600 


G C AAG AGC AG 


CGGCAGACAT 


GCAGGGCACC 


CTCCCCTCCT 


CCTACCGCGT 


CAGGAGGGGC 


12660 


GACATCCGCG 


GTTGACGCGG 


CAGCAGATGG 


TGATTACGAA 


CCCCCGCGGC 


GCCGGGCCCG 


12720 


GCACTACCTG 


GACTTGGAGG 


AGGGCGAGGG 


CCTGGCGCGG 


CTAGGAGCGC 


CCTCTCCTGA 


12780 


GCGGTACCCA 


AGGGTGCAGC 


TGAAGCGTGA 


TACGCGTGAG 


GCGTACGTGC* CGCGGCAGAA 


12840 


CCTGTTTCGC 


GACCGCGAGG 


GAGAGGAGCC 


CGAGGAGATG 


CGGGATCGAA AGTTCCACGC 


1 ^ A A A 

12900 


AGGGCGCGAG 


CTGCGGCATG 


GCCTGAATCG 


CGAGCGGTTG 


CTGCGCGAGG 


AGGACTTTGA 


12960 


GCCCGACGCG 


CGAACCGGGA 


TTAGTCCCGC 


GCGCGCACAC 


GTGGCGGCCG 


CCGACCTGGT 


13020 


AACCGCATAC 


GAGCAGACGG 


TGAACCAGGA 


GATTAACTTT 


CAAAAAAGCT 


TTAACAACCA 


13080 


LG1GCGTACG 


CTTGTGGCGC 


GCGAGGAGGT 


GGCTATAGGA 


CTGATGCATC 


TGTGGGACTT 


13140 


TGTAAGCG CG 


CTGGAGCAAA ACCCAAATAG 


CAAGCCGCTC 


ATGGCGCAGC 


TGTTCCTTAT 


13200 


AG TG C AGC AC 


AGCAGGGACA 


ACGAGGCATT 


CAGGGATGCG 


CTGCTAAACA 


TAGTAGAGCC 


13260 


LGAGGGCCGC 


TGGCTGCTCG 


ATTTGATAAA 


CATCCTGCAG 


AGCATAGTGG 


TGCAGGAGCG 


13320 


CAGCTTGAGC 


CTGGCTGACA 


AGGTGGCCGC 


CATCAACTAT 


TCCATGCTTA 


GCCTGGGCAA 


13380 


GTTTTACGCC 


CGCAAGATAT 


ACCATACCCC 


TTACGTTCCC 


ATAGACAAGG 


AGGTAAAGAT 


13440 


CGAGGGGTTC 


TACATGCGCA 


TGGCGCTGAA 


GGTGCTTACC 


TTGAGCGACG 


ACCTGGGCGT 


13500 


TTATCGCAAC 


GAGCGCATCC 


ACAAGGCCGT 


GAGCGTGAGC 


CGGCGGCGCG 


AGCTCAGCGA 


13560 


CCGCGAGCTG 


ATGCACAGCC 


TGCAAAGGGC 


CCTGGCTGGC 


ACGGGCAGCG 


GCGATAGAGA 


13620 


GGCCGAGTCC 


TACTTTGACG 


CGGGCGCTGA 


CCTGCGCTGG 


GCCCCAAGCC 


GACGCGCCCT 


13680 
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GGGGCCGGAC 


CTGGGCTGGC 


GGTGGCACCC 


GCGCGCGCTG 
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GCAACGTCGG 13 740 


CGGCGTGGAG 


GAATATGACG 


AGGACGATGA 


GTACGAGCCA 


GAGGACGGCG 


AGTACTAAGC 


13800 


GGTGATGTTT 


CTGATCAGAT 


GATGCAAGAC 


GCAACGGACC 


CGGCGGTGCG 


GGCGGCGCTG 


13860 


CAGAGCCAGC 


CGTCCGGCCT 


TAACTCCACG 


GACGACTGGC 


GCCAGGTCAT 


GGACCGCATC 


13920 


ATGTCGCTGA 


CTGCGCGCAA 


TCCTGACGCG 


TTCCGGCAGC 


AGCCGCAGGC 


CAACCGGCTC 


13980 


TCCGCAATTC 


TGGAAGCGGT 


GGTCCCGGCG 


CGCGCAAACC 


CCACGCACGA 


GAAGGTGCTG 


14040 


GCGATCGTAA 


ACGCGCTGGC 


CGAAAACAGG 


GCCATCCGGC 


CCGACGAGGC 


CGGCCTGGTC 


14100 


TACGACGCGC 


TGCTTCAGCG 


CGTGGCTCGT 


TACAACAGCG 


GCAACGTGCA 


GACCAACCTG 


14160 


GACCGGCTGG 


TGGGGGATGT 


GCGCGAGGCC 


GTGGCGCAGC 


GTGAGCGCGC 


GCAGCAGCAG 


14220 


GGCAACCTGG 


GCTCCATGGT 


TGCACTAAAC 


GCCTTCCTGA 


GTACACAGCC 


CGCCAACGTG 


14280 


CCGCGGGGAC 


AGGAGGACTA 


CACCAACTTT 


GTGAGCGCAC 


TGCGGCTAAT 


GGTGACTGAG 


14340 


ACACCGCAAA 


GTGAGGTGTA 


CCAGTCTGGG 


CCAGACTATT 


TTTTCCAGAC 


CAGTAGACAA 


14400 


GGCCTGCAGA 


CCGTAAACCT 


GAGCCAGGCT 


TTCAAAAACT 


TGCAGGGGCT 


GTGGGGGGTG 


14460 


CGGGCTCCCA 


CAGGCGACCG 


CGCGACCGTG 


TCTAGCTTGC 


TGACGCCCAA 


CTCGCGCCTG 


14520 


TTGCTGCTGC 


TAATAGCGCC 


CTTCACGGAC 


AGTGGCAGCG 


TGTCCCGGGA 


CACATACCTA 


14580 


GGTCACTTGC 


TGACACTGTA 


CCGCGAGGCC 


ATAGGTCAGG 


CGCATGTGGA 


CGAGCATACT 


14640 


TTCCAGGAGA 


TTACAAGTGT 


CAGCCGCGCG 


CTGGGGCAGG 


AGGACACGGG 


CAGCCTGGAG 


14700 


GCAACCCTAA 


ACTACCTGCT 


GACCAACCGG 


CGGCAGAAGA 


TCCCCTCGTT 


GCACAGTTTA 


14760 


AACAGCGAGG 


AGGAGCGCAT 


TTTGCGCTAC 


GTGCAGCAGA 


GCGTGAGCCT 


TAACCTGATG 


14820 


CGCGACGGGG 


TAACGCCCAG 


CGTGGCGCTG 


GACATGACCG 


CGCGCAACAT 


GGAACCGGGC 


14880 


ATGTATGCCT 


1 CAAACCGGCC 


: GTTTATCAAC 


! CGCCTAATGG 


ACTACTTGCA 


TCGCGCGGCC 


14940 



GCCGTGAACC CCGAGTATTT CACCAATGCC ATCTTGAACC CGCACTGGCT ACCGCCCCCT 15000 

GGTTTCTACA CCGGGGGATT CGAGGTGCCC GAGGGTAACG ATGGATTCCT CTGGGACGAC 15060 

ATAGACGACA GCGTGTTTTC CCCGCAACCG CAGACCCTGC TAGAGTTGCA ACAGCGCGAG 15120 

CAGGCAGAGG CGGCGCTGCG AAAGGAAAGC TTCCGCAGGC CAAGCAGCTT GTCCGATCTA 15180 

GGCGCTGCGG CCCCGCGGTC AGATGCTAGT AGCCCATTTC CAAGCTTGAT AGGGTCTCTT 15240 

ACCAGCACTC GCACCACCCG CCCGCGCCTG CTGGGCGAGG AGGAGTACCT AAACAACTCG 15300 

CTGCTGCAGC CGCAGCGCGA AAAAAACCTG CCTCCGGCAT TTCCCAACAA CGGGATAGAG 15360 

AGCCTAGTGG ACAAGATGAG TAGATGGAAG ACGTACGCGC AGGAGCACAG GGACGTGCCA 15420 

GGCCCGCGCC CGCCCACCCG TCGTCAAAGG CACGACCGTC AGCGGGGTCT GGTGTGGGAG 15480 

GACGATGACT CGGCAGACGA CAGCAGCGTC CTGGATTTGG GAGGGAGTGG CAACCCGTTT 15540 

GCGCACCTTC GCCCCAGGCT GGGGAGAATG TTTTAAAAAA AAAAAAGCAT GATGCAAAAT 15600 

AAAAAACTCA CCAAGGCCAT GGCACCGAGC GTTGGTTTTC TTGTATTCCC CTTAGTATGC 15660 

GGCGCGCGGC GATGTATGAG GAAGGTCCTC CTCCCTCCTA CGAGAGTGTG GTGAGCGCGG 15720 

CGCCAGTGGC GGCGGCGCTG GGTTCTCCCT TCGATGCTCC CCTGGACCCG CCGTTTGTGC 15780 
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GAGTTGGCAC 15840 


CCCTATTCGA 


CACCACCCGT 


<TflR JTi ^VT Tl 

GTGTACCTGG 


TGGACAACAA 


GTCAACGGAT 


GTGGCATCCC 


15900 


TGAACTACCA 


GAACGACCAC 


4fc XT XT 4 T 

AGCAACTTTC 


TGACCACGGT 


CATTCAAAAC 


AATGACTACA 


15960 

mm 7 m* T^ %X 


GCCCGGGGGA 


GGCAAGCACA 


XT XT Tk XT *m m\ _ 

CAGACCATCA 


ATCTTGACGA 


CCGGTCGCAC 


TGGGGCGGCG 


16020 


AC CTG AAAAC 


CATCCTGCAT 


ACCAACATGC 


CAAATGTGAA 


CGAGTTCATG 


TTTACCAATA 


16080 


AG TTT AAGG C 


GCGGGTGATG 


GTGTCGCGCT 


TGCCTACTAA 


GGACAATCAG 


GTGGAGCTGA 


16140 


TV TV TV /T TV /tj rr>.^T 

AATACGAGTG 


GGTGGAGTTC 


ACGCTGCCCG 


AGGGCAACTA 


CTCCGAGACC 


ATGACCATAG 


16200 


TV ^^^^^FT^T^TA ^^^^ TV 7V 

AC CTT ATGAA 


CAACGCGATC 


XT P4T XT XT m XT XT ^ 

GTGGAGCACT 


ACTTGAAAGT 


GGGCAGACAG 


AACGGGGTTC 


16260 


1 G G AAAG CG A 


CATCGGGGTA 


T Tk XIMWTPMX4 v mm & 

AAGTTTGACA 


CCCGCAACTT 


CAGACTGGGG 


TTTGACCCCG 


16320 


I LAC I GGTCT 


TGTCATGCCT 


XT XT XT XT Ma « «a o_ _ 

GGGGTATATA 


CAAACGAAGC 


CTTCCATCCA 


GACATCATTT 


16380 

^x «x ^x \f 


TGCTGCCAGG 


ATGCGGGGTG 


GACTTCACCC 


ACAGCCGCCT 


GAGCAACTTG 


TTGGGCATCC 


16440 


GCAAGCGGCA 


ACCCTTCCAG 


GAGGGCTTTA 


GGATCACCTA 


CGATGATCTG 


GAGGGTGGTA 


16500 

m. vx ^x %x v 


TV /"i Tv mm /r /t /tat /tj 

ACATTCCCGC 


ACTGTTGGAT 


GTGGACGCCT 


ACCAGGCGAG 


CTTGAAAGAT 


GACACCGAAC 


16560 

^ w W W/ 


AGGGCGGGGG 


TGGCGCAGGC 


GGCAGCAACA 


GCAGTGGCAG 


CGGCGCGGAA 


GAGAACTCCA 


16620 

VX \J b W 


TV ^^^1 ^1 XT /T xt TA ^« *— ■ 

ACGCGGCAGC 


CGCGGCAATG 


CAGCCGGTGG 


AGGACATGAA 


CGATCATGCC 


ATTCGCGGCG 


x u o a w 


TV /T TV /T »T^»T7 fT> ^T 

ACACCTTTGC 


CACACGGGCT 


GAGGAGAAGC 


GCGCTGAGGC 


CGAAGCAGCG 


GCCGAAGCTG 


1 fi74.fl 


CCGCCCCCGC 


TGCGCAACCC 


GAGGTCGAGA 


AGCCTCAGAA 


GAAACCGGTG 


ATC AAAC C C C 


lOOUU 


TGACAGAGGA 


CAGCAAGAAA 


CGCAGTTACA 


ACCTAATAAG 


CAATGACAGC 
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CTATGACTCA GGGTCGCAGG GGCAACGTGT ATTGGGTGCG CGACTCGGTT AGCGGCCTGC 17940 

GCGTGCCCGT GCGCACCCGC CCCCCGCGCA ACTAGATTGC AAGAAAAAAC TACTTAGACT 18 000 

CGTACTGTTG TATGTATCCA GCGGCGGCGG CGCGCAACGA AGCTATGTCC AAGCGCAAAA 18060 

TCAAAGAAGA GATGCTCCAG GTCATCGCGC CGGAGATCTA TGGCCCCCCG AAGAAGGAAG 18120 



AGCAGGATTA CAAGCCCCGA AAGCTAAAGC GGGTCAAAAA GAAAAAGAAA GATGATGATG 
ATGAACTTGA CGACGAGGTG GAACTGCTGC ACGCTACCGC GCCCAGGCGA CGGGTACAGT 
GGAAAGGTCG ACGCGTAAAA CGTGTTTTGC GACCCGGCAC CACCGTAGTC TTTACGCCCG 
GTGAGCGCTC CACCCGCACC TACAAGCGCG TGTATGATGA GGTGTACGGC GACGAGGACC 
TGCTTGAGCA GGCCAACGAG CGCCTCGGGG AGTTTGCCTA CGGAAAGCGG CATAAGGACA 
TGCTGGCGTT GCCGCTGGAC GAGGGCAACC CAACACCTAG CCTAAAGCCC GTAACACTGC 
AGCAGGTGCT GCCCGCGCTT GCACCGTCCG AAGAAAAGCG CGGCCTAAAG CGCGAGTCTG 
GTGACTTGGC ACCCACCGTG CAGCTGATGG TACCCAAGCG CCAGCGACTG GAAGATGTCT 
TGGAAAAAAT GACCGTGGAA CCTGGGCTGG AGCCCGAGGT CCGCGTGCGG CCAATCAAGC 
AGGTGGCGCC GGGACTGGGC GTGCAGACCG TGGACGTTCA GATACCCACT ACCAGTAGCA 
CCAGTATTGC CACCGCCACA GAGGGCATGG AGACACAAAC GTCCCCGGTT GCCTCAGCGG 
TGGCGGATGC CGCGGTGCAG GCGGTCGCTG CGGCCGCGTC CAAGACCTCT ACGGAGGTGC 
AAACGGACCC GTGGATGTTT CGCGTTTCAG CCCCCCGGCG CCCGCGCGGT TCGAGGAAGT 
ACGGCGCCGC CAGCGCGCTA CTGCCCGAAT ATGCCCTACA TCCTTCCATT GCGCCTACCC 
CCGGCTATCG TGGCTACACC TACCGCCCCA GAAGACGAGC AACTACCCGA CGCCGAACCA 
CCACTGGAAC CCGCCGCCGC CGTCGCCGTC GCCAGCCCGT GCTGGCCCCG ATTTCCGTGC 
GCAGGGTGGC TCGCGAAGGA GGCAGGACCC TGGTGCTGCC AACAGCGCGC TACCACCCCA 
GCATCGTTTA AAAGCCGGTC TTTGTGGTTC TTGCAGATAT GGCCCTCACC TGCCGCCTCC 
GTTTCCCGGT GCCGGGATTC CGAGGAAGAA TGCACCGTAG GAGGGGCATG GCCGGCCACG 
GCCTGACGGG CGGCATGCGT CGTGCGCACC ACCGGCGGCG GCGCGCGTCG CACCGTCGCA 
TGCGCGGCGG TATCCTGCCC CTCCTTATTC CACTGATCGC CGCGGCGATT GGCGCCGTGC 
CCGGAATTGC ATCCGTGGCC TTGCAGGCGC AGAGACACTG ATTAAAAACA AGTTGCATGT 
GGAAAAATCA AAATAAAAAG TCTGGACTCT CACGCTCGCT TGGTCCTGTA ACTATTTTGT 
AGAATGGAAG ACATCAACTT TGCGTCTCTG GCCCCGCGAC ACGGCTCGCG CCCGTTCATG 
GGAAACTGGC AAGATATCGG CACCAGCAAT ATGAGCGGTG GCGCCTTCAG CTGGGGCTCG 
CTGTGGAGCG GCATTAAAAA TTTCGGTTCC ACCGTTAAGA ACTATGGCAG CAAGGCCTGG 
AACAGCAGCA CAGGCCAGAT GCTGAGGGAT AAGTTGAAAG AGCAAAATTT CCAACAAAAG 
GTGGTAGATG GCCTGGCCTC TGGCATTAGC GGGGTGGTGG ACCTGGCCAA CCAGGCAGTG 
CAAAATAAGA TTAACAGTAA GCTTGATCCC CGCCCTCCCG TAGAGGAGCC TCCACCGGCC 
GTGGAGACAG TGTCTCCAGA GGGGCGTGGC GAAAAGCGTC CGCGCCCCGA CAGGGAAGAA 
ACTCTGGTGA CGCAAATAGA CGAGCCTCCC TCGTACGAGG AGGCACTAAA GCAAGGCCTG 
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CCTACGAGTG GAACTTCAGG AAGGATGTTA ACATGGTTCT GCAGAGCTCC CTAGGAAATG 22140 

ACCTAAGGGT TGACGGAGCC AGCATTAAGT TTGATAGCAT TTGCCTTTAC GCCACCTTCT 22200 

TCCCCATGGC CCACAACACC GCCTCCACGC TTGAGGCCAT GCTTAGAAAC GACACCAACG 22260 

ACCAGTCCTT TAACGACTAT CTCTCCGCCG CCAACATGCT CTACCCTATA CCCGCCAACG 22320 

CTACCAACGT GCCCATATCC ATCCCCTCCC GCAACTGGGC GGCTTTCCGC GGCTGGGCCT 2238 0 

TCACGCGCCT TAAGACTAAG GAAACCCCAT CACTGGGCTC GGGCTACGAC CCTTATTACA 22440 

CCTACTCTGG CTCTATACCC TACCTAGATG GAACCTTTTA CCTCAACCAC ACCTTTAAGA 22500 

AGGTGGCCAT TACCTTTGAC TCTTCTGTCA GCTGGCCTGG CAATGACCGC CTGCTTACCC 22560 

CCAACGAGTT TGAAATTAAG CGCTCAGTTG ACGGGGAGGG TTACAACGTT GCCCAGTGTA 22620 

ACATGACCAA AGACTGGTTC CTGGTACAAA TGCTAGCTAA CTACAACATT GGCTACCAGG 22680 

GCTTCTATAT CCCAGAGAGC TACAAGGACC GCATGTACTC CTTCTTTAGA AACTTCCAGC 22740 

CCATGAGCCG TCAGGTGGTG GATGATACTA AATACAAGGA CTACCAACAG GTGGGCATCC 22800 

TACACCAACA CAACAACTCT GGATTTGTTG GCTACCTTGC CCCCACCATG CGCGAAGGAC 2286 0 

AGGCCTACCC TGCTAACTTC CCCTATCCGC TTATAGGCAA GACCGCAGTT GACAGCATTA 22920 

CCCAGAAAAA GTTTCTTTGC GATCGCACCC TTTGGCGCAT CCCATTCTCC AGTAACTTTA 22980 

TGTCCATGGG CGCACTCACA GACCTGGGCC AAAACCTTCT CTACGCCAAC TCCGCCCACG 23040 

CGCTAGACAT GACTTTTGAG GTGGATCCCA TGGACGAGCC CACCCTTCTT TATGTTTTGT 23100 

TTGAAGTCTT TGACGTGGTC CGTGTGCACC GGCCGCACCG CGGCGTCATC GAAACCGTGT 23160 

ACCTGCGCAC GCCCTTCTCG GCCGGCAACG CCACAACATA AAGAAGCAAG CAACATCAAC 23220 

AACAGCTGCC GCCATGGGCT CCAGTGAGCA GGAACTGAAA GCCATTGTCA AAGATCTTGG 23280 

TTGTGGGCCA TATTTTTTGG GCACCTATGA CAAGCGCTTT CCAGGCTTTG TTTCTCCACA 23340 

CAAGCTCGCC TGCGCCATAG TCAATACGGC CGGTCGCGAG ACTGGGGGCG TACACTGGAT 23400 

GGCCTTTGCC TGGAACCCGC ACTCAAAAAC ATGCTACCTC TTTGAGCCCT TTGGCTTTTC 23460 

TGACCAGCGA CTCAAGCAGG TTTACCAGTT TGAGTACGAG TCACTCCTGC GCCGTAGCGC 23520 

CATTGCTTCT TCCCCCGACC GCTGTATAAC GCTGGAAAAG TCCACCCAAA GCGTACAGGG 23580 

GCCCAACTCG GCCGCCTGTG GACTATTCTG CTGCATGTTT CTCCACGCCT TTGCCAACTG 23640 

GCCCCAAACT CCCATGGATC ACAACCCCAC CATGAACCTT ATTACCGGGG TACCCAACTC 23700 

CATGCTCAAC AGTCCCCAGG TACAGCCCAC CCTGCGTCGC AACCAGGAAC AGCTCTACAG 23760 

CTTCCTGGAG CGCCACTCGC CCTACTTCCG CAGCCACAGT GCGCAGATTA GGAGCGCCAC 23820 

TTCTTTTTGT CACTTGAAAA ACATGTAAAA ATAATGTACT AGAGACACTT TCAATAAAGG 23880 

CAAATGCTTT TATTTGTACA CTCTCGGGTG ATTATTTACC CCCACCCTTG CCGTCTGCGC 23940 

CGTTTAAAAA TCAAAGGGGT TCTGCCGCGC ATCGCTATGC GCCACTGGCA GGGACACGTT 24000 

GCGATACTGG TGTTTAGTGC TCCACTTAAA CTCAGGCACA ACCATCCGCG GCAGCTCGGT 24 060 
GAAGTTTTCA CTCCACAGGC TGCGCACCAT CACCAACGCG TTTAGCAGGT CGGGCGCCGA 24120 
TATCTTGAAG TCGCAGTTGG GGCCTCCGCC CTGCGCGCGC GAGTTGCGAT ACACAGGGTT 24180 
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TGGAGTGTTG GTGGAACTCG AGGGTGACAA CGCGCGCCTA GCCGTACTAA AACGCAGCAT 26340 

CGAGGTCACC CACTTTGCCT ACCCGGCACT TAACCTACCC CCCAAGGTCA TGAGCACAGT 26400 

CATGAGTGAG CTGATCGTGC GCCGTGCGCA GCCCCTGGAG AGGGATGCAA ATTTGCAAGA 2646 0 

ACAAACAGAG GAGGGCCTAC CCGCAGTTGG CGACGAGCAG CTAGCGCGCT GGCTTCAAAC 26520 



GCGCGAGCCT GCCGACTTGG AGGAGCGACG CAAACTAATG ATGGCCGCAG TGCTCGTTAC 
CGTGGAGCTT GAGTGCATGC AGCGGTTCTT TGCTGACCCG GAGATGCAGC GCAAGCTAGA 
GGAAACATTG CACTACACCT TTCGACAGGG CTACGTACGC CAGGCCTGCA AGATCTCCAA 
CGTGGAGCTC TGCAACCTGG TCTCCTACCT TGGAATTTTG CACGAAAACC GCCTTGGGCA 
AAACGTGCTT CATTCCACGC TCAAGGGCGA GGCGCGCCGC GACTACGTCC GCGACTGCGT 
TTACTTATTT CTATGCTACA CCTGGCAGAC GGCCATGGGC GTTTGGCAGC AGTGCTTGGA 
GGAGTGCAAC CTCAAGGAGC TGCAGAAACT GCTAAAGCAA AACTTGAAGG ACCTATGGAC 
GGCCTTCAAC GAGCGCTCCG TGGCCGCGCA CCTGGCGGAC ATCATTTTCC CCGAACGCCT 
GCTTAAAACC CTGCAACAGG GTCTGCCAGA CTTCACCAGT CAAAGCATGT TGCAGAACTT 
TAGGAACTTT ATCCTAGAGC GCTCAGGAAT CTTGCCCGCC ACCTGCTGTG CACTTCCTAG 
CGACTTTGTG CCCATTAAGT ACCGCGAATG CCCTCCGCCG CTTTGGGGCC ACTGCTACCT 
TCTGCAGCTA GCCAACTACC TTGCCTACCA CTCTGACATA ATGGAAGACG TGAGCGGTGA 
CGGTCTACTG GAGTGTCACT GTCGCTGCAA CCTATGCACC CCGCACCGCT CCCTGGTTTG 
CAATTCGCAG CTGCTTAACG AAAGTCAAAT TATCGGTACC TTTGAGCTGC AGGGTCCCTC 
GCCTGACGAA AAGTCCGCGG CTCCGGGGTT GAAACTCACT CCGGGGCTGT GGACGTCGGC 
TTACCTTCGC AAATTTGTAC CTGAGGACTA CCACGCCCAC GAGATTAGGT TCTACGAAGA 
CCAATCCCGC CCGCCAAATG CGGAGCTTAC CGCCTGCGTC ATTACCCAGG GCCACATTCT 
TGGCCAATTG CAAGCCATCA ACAAAGCCCG CCAAGAGTTT CTGCTACGAA AGGGACGGGG 
GGTTTACTTG GACCCCCAGT CCGGCGAGGA GCTCAACCCA ATCCCCCCGC CGCCGCAGCC 
CTATCAGCAG CAGCCGCGGG CCCTTGCTTC CCAGGATGGC ACCCAAAAAG AAGCTGCAGC 
TGCCGCCGCC ACCCACGGAC GAGGAGGAAT ACTGGGACAG TCAGGCAGAG GAGGTTTTGG 
ACGAGGAGGA GGAGGACATG ATGGAAGACT GGGAGAGCCT AGACGAGGAA GCTTCCGAGG 
TCGAAGAGGT GTCAGACGAA ACACCGTCAC CCTCGGTCGC ATTCCCCTCG CCGGCGCCCC 
AGAAATCGGC AACCGGTTCC AGCATGGCTA CAACCTCCGC TCCTCAGGCG CCGCCGGCAC 
TGCCCGTTCG CCGACCCAAC CGTAGATGGG ACACCACTGG AACCAGGGCC GGTAAGTCCA 
AGCAGCCGCC GCCGTTAGCC CAAGAGCAAC AACAGCGCCA AGGCTACCGC TCATGGCGCG 
GGCACAAGAA CGCCATAGTT GCTTGCTTGC AAGACTGTGG GGGCAACATC TCCTTCGCCC 
GCCGCTTTCT TCTCTACCAT CACGGCGTGG CCTTCCCCCG TAACATCCTG CATTACTACC 
GTCATCTCTA CAGCCCATAC TGCACCGGCG GCAGCGGCAG CGGCAGCAAC AGCAGCGGCC 
ACACAGAAGC AAAGGCGACC GGATAGCAAG ACTCTGACAA AGCCCAAGAA ATCCACAGCG 
GCGGCAGCAG CAGGAGGAGG AGCGCTGCGT CTGGCGCCCA ACGAACCCGT ATCGACCCGC 



26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 
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GAGCTTAGAA ACAGGATTTT TCCCACTCTG TATGCTATAT TTCAACAGAG CAGGGGCCAA 28440 
GAACAAGAGC TGAAAATAAA AAACAGGTCT CTGCGATCCC TCACCCGCAG CTGCCTGTAT 28500 
CACAAAAGCG AAGATCAGCT TCGGCGCACG CTGGAAGACG CGGAGGCTCT CTTCAGTAAA 28560 
TACTGCGCGC TGACTCTTAA GGACTAGTTT CGCGCCCTTT CTCAAATTTA AGCGCGAAAA 2862 0 



CTACGTCATC TCCAGCGGCC ACACCCGGCG CCAGCACCTG TCGTCAGCGC CATTATGAGC 
AAGGAAATTC CCACGCCCTA CATGTGGAGT TACCAGCCAC AAATGGGACT TGCGGCTGGA 
GCTGCCCAAG ACTACTCAAC CCGAATAAAC TACATGAGCG CGGGACCCCA CATGATATCC 
CGGGTCAACG GAATCCGCGC CCACCGAAAC CGAATTCTCT TGGAACAGGC GGCTATTACC 
ACCACACCTC GTAATAACCT TAATCCCCGT AGTTGGCCCG CTGCCCTGGT GTACCAGGAA 
AGTCCCGCTC CCACCACTGT GGTACTTCCC AGAGACGCCC AGGCCGAAGT TCAGATGACT 
AACTCAGGGG CGCAGCTTGC GGGCGGCTTT CGTCACAGGG TGCGGTCGCC CGGGCAGGGT 
ATAACTCACC TGACAATCAG AGGGCGAGGT ATTCAGCTCA ACGACGAGTC GGTGAGCTCC 
TCGCTTGGTC TCCGTCCGGA CGGGACATTT CAGATCGGCG GCGCCGGCCG TCCTTCATTC 
ACGCCTCGTC AGGCAATCCT AACTCTGCAG ACCTCGTCCT CTGAGCCGCG CTCTGGAGGC 
ATTGGAACTC TGCAATTTAT TGAGGAGTTT GTGCCATCGG TCTACTTTAA CCCCTTCTCG 
GGACCTCCCG GCCACTATCC GGATCAATTT ATTCCTAACT TTGACGCGGT AAAGGACTCG 
GCGGACGGCT ACGACTGAAT GTTAATTAAG TTCCTGTCCA TCCGCACCCA CTATCTTCAT 
GTTGTTGCAG ATGAAGCGCG CAAGACCGTC TGAAGATACC TTCAACCCCG TGTATCCATA 
TGACACGGAA ACCGGTCCTC CAACTGTGCC TTTTCTTACT CCTCCCTTTG TATCCCCCAA 
TGGGTTTCAA GAGAGTCCCC CTGGGGTACT CTCTTTGCGC CTATCCGAAC CTCTAGTTAC 
CTCCAATGGC ATGCTTGCGC TCAAAATGGG CAACGGCCTC TCTCTGGACG AGGCCGGCAA 
CCTTACCTCC CAAAATGTAA CCACTGTGAG CCCACCTCTC AAAAAAACCA AGTCAAACAT 
AAACCTGGAA ATATCTGCAC CCCTCACAGT TACCTCAGAA GCCCTAACTG TGGCTGCCGC 
-CGCACCTCTA ATGGTCGCGG GCAACACACT CACCATGCAA TCACAGGCCC CGCTAACCGT 
GCACGACTCC AAACTTAGCA TTGCCACCCA AGGACCCCTC ACAGTGTCAG AAGGAAAGCT 29880 
AGCCCTGCAA ACATCAGGCC CCCTCACCAC CACCGATAGC AGTACCCTTA CTATCACTGC 29940 
CTCACCCCCT CTAACTACTG CCACTGGTAG CTTGGGCATT GACTTGAAAG AGCCCATTTA 
TACACAAAAT GGAAAACTAG GACTAAAGTA CGGGGCTCCT TTGCATGTAA CAGACGACCT 
AAACACTTTG ACCGTAGCAA CTGGTCCAGG TGTGACTATT AATAATACTT CCTTGCAAAC 
TAAAGTTACT GGAGCCTTGG GTTTTGATTC ACAAGGCAAT ATGCAACTTA ATGTAGCAGG 
AGGACTAAGG ATTGATTCTC AAAACAGACG CCTTATACTT GATGTTAGTT ATCCGTTTGA 
TGCTCAAAAC CAACTAAATC TAAGACTAGG ACAGGGCCCT CTTTTTATAA ACTCAGCCCA 
CAACTTGGAT ATTAACTACA ACAAAGGCCT TTACTTGTTT ACAGCTTCAA ACAATTCCAA 
AAAGCTTGAG GTTAACCTAA GCACTGCCAA GGGGTTGATG TTTGACGCTA CAGCCATAGC 
CATTAATGCA GGAGATGGGC TTGAATTTGG TTCACCTAAT GCACCAAACA CAAATCCCCT 
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CAAAACAAAA ATTGGCCATG GCCTAGAATT TGATTCAAAC AAGGCTATGG TTCCTAAACT 30540 
AGGAACTGGC CTTAGTTTTG ACAGCACAGG TGCCATTACA GTAGGAAACA AAAATAATGA 30600 
TAAGCTAACT TTGTGGACCA CACCAGCTCC ATCTCCTAAC TGTAGACTAA ATGCAGAGAA 30660 
AGATGCTAAA CTCACTTTGG TCTTAACAAA ATGTGGCAGT CAAATACTTG CTACAGTTTC 30720 
AGTTTTGGCT GTTAAAGGCA GTTTGGCTCC AATATCTGGA ACAGTTCAAA GTGCTCATCT 
TATTATAAGA TTTGACGAAA ATGGAGTGCT ACTAAACAAT TCCTTCCTGG ACCCAGAATA 
TTGGAACTTT AGAAATGGAG ATCTTACTGA AGGCACAGCC TATACAAACG CTGTTGGATT 
TATGCCTAAC CTATCAGCTT ATCCAAAATC TCACGGTAAA ACTGCCAAAA GTAACATTGT 
CAGTCAAGTT TACTTAAACG GAGACAAAAC TAAACCTGTA ACACTAACCA TTACACTAAA 31020 
CGGTACACAG GAAACAGGAG ACACAACTCC AAGTGCATAC TCTATGTCAT TTTCATGGGA 31080 
CTGGTCTGGC CACAACTACA TTAATGAAAT ATTTGCCACA TCCTCTTACA CTTTTTCATA 
CATTGCCCAA GAATAAAGAA TCGTTTGTGT TATGTTTCAA CGTGTTTATT TTTCAATTGC 
AGAAAATTTC AAGTCATTTT TCATTCAGTA GTATAGCCCC ACCACCACAT AGCTTATACA 
GATCACCGTA CCTTAATCAA ACTCACAGAA CCCTAGTATT CAACCTGCCA CCTCCCTCCC 
AACACACAGA GTACACAGTC CTTTCTCCCC GGCTGGCCTT AAAAAGCATC ATATCATGGG 
TAACAGACAT ATTCTTAGGT GTTATATTCC ACACGGTTTC CTGTCGAGCC AAACGCTCAT 
CAGTGATATT AATAAACTCC CCGGGCAGCT CACTTAAGTT CATGTCGCTG TCCAGCTGCT 
GAGCCACAGG CTGCTGTCCA ACTTGCGGTT GCTTAACGGG CGGCGAAGGA GAAGTCCACG 
CCTACATGGG GGTAGAGTCA TAATCGTGCA TCAGGATAGG GCGGTGGTGC TGCAGCAGCG 
CGCGAATAAA CTGCTGCCGC CGCCGCTCCG TCCTGCAGGA ATACAACATG GCAGTGGTCT 
CCTCAGCGAT GATTCGCACC GCCCGCAGCA TAAGGCGCCT TGTCCTCCGG GCACAGCAGC 
GCACCCTGAT CTCACTTAAA TCAGCACAGT AACTGCAGCA CAGCACCACA ATATTGTTCA 
AAATCCCACA GTGCAAGGCG CTGTATCCAA AGCTCATGGC GGGGACCACA GAACCCACGT 
GGCCATCATA CCACAAGCGC AGGTAGATTA AGTGGCGACC CCTCATAAAC ACGCTGGACA 
TAAACATTAC CTCTTTTGGC ATGTTGTAAT TCACCACCTC CCGGTACCAT ATAAACCTCT 
GATTAAACAT GGCGCCATCC ACCACCATCC TAAACCAGCT GGCCAAAACC TGCCCGCCGG 
CTATACACTG CAGGGAACCG GGACTGGAAC AATGACAGTG GAGAGCCCAG GACTCGTAAC 
CATGGATCAT CATGCTCGTC ATGATATCAA TGTTGGCACA ACACAGGCAC ACGTGCATAC 
ACTTCCTCAG GATTACAAGC TCCTCCCGCG TTAGAACCAT ATCCCAGGGA ACAACCCATT 
CCTGAATCAG CGTAAATCCC ACACTGCAGG GAAGACCTCG CACGTAACTC ACGTTGTGCA 
TTGTCAAAGT GTTACATTCG GGCAGCAGCG GATGATCCTC CAGTATGGTA GCGCGGGTTT 
CTGTCTCAAA AGGAGGTAGA CGATCCCTAC TGTACGGAGT GCGCCGAGAC AACCGAGATC 
GTGTTGGTCG TAGTGTCATG CCAAATGGAA CGCCGGACGT AGTCATATTT CCTGAAGCAA 
AACCAGGTGC GGGCGTGACA AACAGATCTG CGTCTCCGGT CTCGCCGCTT AGATCGCTCT 
GTGTAGTAGT TGTAGTATAT CCACTCTCTC AAAGCATCCA GGCGCCCCCT GGCTTCGGGT 
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TCTATGTAAA 


CTCCTTCATG 


CGCCGCTGCC 


CTGATAACAT 


CCACCACCGC 


AGAATAAHPP 




ACACCCAGCC 


AACCTACACA 


TTCGTTCTGC 


GAGTCACACA 


CGGGAGGAGC 


GGGAAGAGPT 




GGAAGAACCA 


TGTTTTTTTT 


TTTATTCCAA AAGATTATCC AAAACCTCAA 


AATGAAGATP 


/ bU 


TATTAAGTGA 


ACGCGCTCCC 


CTCCGGTGGC 


GTGGTCAAAC 


TCTACAGCCA 


AAGAACAGAT 


"l 9 p *5 n 


AATGGCATTT 


GTAAGATGTT 


GCACAATGGC 


TTCCAAAAGG 


CAAACGGCCC 


TCACGTCCAA 




GTGGACGTAA 


AGGCTAAACC 


CTTCAGGGTG 


AATCTCCTCT 


ATAAACATTC 


CAGCACCTTC 




AACCATGCCC 


AAATAATTCT 


CATCTCGCCA 


CCTTCTCAAT 


ATATCTCTAA GCAAATCCCG 


1 t n n n 


AATATTAAGT 


CCGGCCATTG 


TAAAAATCTG 


CTCCAGAGCG 


CCCTCCACCT 


TCAGCCTCAA 


J 3060 


GCAGCGAATC 


ATGATTGCAA 


AAATTCAGGT 


TCCTCACAGA 


CCTGTATAAG 


ATT CAAAAG P 


3 3120 


GGAACATTAA 


CAAAAATACC 


GCGATCCCGT 


AGGTCCCTTC 


GCAGGGCCAG 


CTGAACATAA 


33180 


TCGTGCAGGT 


CTGCACGGAC 


CAGCGCGGCC 


ACTTCCCCGC 


CAGGAACCTT 


GACAAAAGAA 


33240 


CCCACACTGA 


TTATGACACG 


CATACTCGGA 


GCTATGCTAA 


CCAGCGTAGC 


CCCGATGTAA 


333 00 


GCTTTGTTGC 


ATGGGCGGCG 


ATATAAAATG 


CAAGGTGCTG 


CTCAAAAAAT 


CAGGC AAAfi P 


33360 


CTCGCGCAAA 


AAAGAAAGCA 


CATCGTAGTC 


ATGCTCATGC 


AGATAAAGGC 


AG GTAAG P TP 


33420 


CGGAACCACC 


ACAGAAAAAG 


ACACCATTTT 


TCTCTCAAAC 


ATGTCTGCGG 


GTTTPTfiPAT 


33480 


AAACACAAAA 


TAAAATAACA 


AAAAAACATT 


TAAACATTAG 


AAGCCTGTCT 


TACAAPAPGA 


"5 ^ c >i r\ 

33540 


AAAACAACCC 


TTATAAGCAT 


AAGACGGACT 


ACGGCCATGC 


CGGCGTGACC 




33600 


TGGTCACCGT 


GATTAAAAAG 


CACCACCGAC 


AGCTCCTCGG 


TCATGTCCGG 




3366 0 


TAAGACTCGG 


TAAACACATC 


AGGTTGATTC 


ATCGGTCAGT 


GCTAAAAAGC 




33 /20 


GCCCGGGGGA 


ATACATACCC 


GCAGGCGTAG 


AGACAACATT 


ACAGCCCCCA 


TAGGAGGTAT 


33780 


AACAAAATTA 


ATAGGAGAGA 


AAAACACATA AACACCTGAA AAACCCTCCT 


GCCTAGGCAA 


33840 


AATAGCACCC 


TCCCGCTCCA 


GAACAACATA 


CAGCGCTTCA 


CAGCGGCAGC 


CTAACAGTCA 


T ^ AAA 

33900 


GCCTTACCAG 


TAAAAAAGAA 


AACCTATTAA 


AAAAACACCA 


CTCGACACGG 


CACCAGCTCA 


33960 


ATCAGTCACA 


GTGTAAAAAA 


GGGCCAAGTG 


CAGAGCGAGT 


ATATATAGGA 


CTAAAAAATG 


34020 


ACGTAACGGT 


TAAAGTCCAC 


AAAAAACACC 


CAGAAAACCG 


CACGCGAACC 


TACGCCCAGA 


34080 


AACGAAAGCC 


AAAAAACCCA 


CAACTTCCTC 


AAATCGTCAC 


TTCCGTTTTC 


CCACGTTACG 


34140 


TAACTTCCCA 


TTTTAAGAAA 


ACTACAATTC 


CCAACACATA 


CAAGTTACTC 


CGCCCTAAAA 


34200 


CCTACGTCAC 


CCGCCCCGTT 


CCCACGCCCC 


GCGCCACGTC 


ACAAACTCCA 


CCCCCTCATT 


34260 


ATCATATTGG 


CTTCAATCCA 


AAATAAGGTA 


TATTATTGAT 


GAT 




34303 


(2) INFORMATION FOR SEQ ID NO: 5: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ■ "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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CCAGGCTTTA 


CACTTTATGC 


TTCCGGCTCG 


TATGTTGTGT 


GGAATTGTGA 


PCT/US97/19541 

GCGGATAACA 60 


ATTTCACACA 


GGAAACAGCT 


ATGACCATGA 


TTACGCCAAG 


CGCGCAATTA 


ACCCTCACTA 


120 


AAGGGAACAA 


AAGCTGGGTA 


CCGGGCCCCC 


CCTCGAGGTC 


GACGGTATCG 




180 


GCGTGGCCTA 


GGCGGCCGAA 


TTCCTGCAGC 


CCGGGGGATC 




AGAGCGGCCG 


240 


CCACCGCGGC 


GCCTTAATTA 


ATACGTAAGC 


TCCAATTCGC 


CCTATAGTGA 


GTCGTATTAC 


300 


GCGCGCTCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


360 


CTTAATCGCC 


TTGCAGCACA 










380 



(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGCCGCAGCA CCGGATGCAT C 21 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GCGTCCGGAG GCTGCCATGC GGCAGGG 27 
(2) INFORMATION FOR SEQ ID NO : 8 : 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GGCTGCCATG CGGCAGGGAT ACGGCGCTAA CGATGCATCT CAACAATTGT TGTGTAGGTA 60 

CTCCGCCGCC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC CTCTCGAGAA 120 

AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC CGTGGCGGGC GGCAGCGGGC 180 

GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC TGCTGATGAT GTAATTAAAG TAGGCGGTCT 240 

TGAGACGGCG GATGGTCGAC AGAAGCACCA TGTCCTTGGG TCCGGCCTGC TGAATGCGCA 300 

GGCGGTCGGC CATGCCCCAG GCTTCGTTTT GACATCGGCG CAGGTCTTTG TAGTAGTCTT 360 

GCATGAGCCT TTCTACCGGC ACTTCTTCTT CTCCTTCCTC TTGTCCTGCA TCTCTTGCAT 420 
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CTATCGCTGC GGCGGCGGCG 
TGACCCCGAA GCCCCTCATC 
ATATGGCCTG CTGCACCTGC 
GGTATGCGCC CGTGTTGATG 
GGTGACCCGG CTGCGAGAGC 
CGTAGTCGTT GCAAGTCCGC 
GGCGGTAGAG GGGCCAGCGT 
GGCGATGATA TCCGTAGATG 
CGCGCGGAAA GTCGCGGACG 
TCGGGACGCT CTGGCCGGTC 
AGCCTGTAAG CGGGCACTCT 
GACGACCGGG GTTCGAGCCC 
GCGTGTCGAA CCCAGGTGTG 
CCAGGCGCGG CGGCTGCTGC 
TTAGGCTGGA AAGCGAAAGC 
AAGGGTTGAG TCGCGGGACC 
GGGTTTGCCT CCCCGTCATG 
CCCCTTTTTT GCTTTTCCCA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



GAATTCCCCA 


TCCTGGTCTA 


TAGAGAGAGT 


TCCAGAACAG 


CCAGGGCTAC 


AGATAAACCC 


60 


ATCTGGAAAA 


ACAAAGTTGA 


ATGACCCAAG 


AGGGGTTCTC 


AGAGGGTGGC 


GTGTGCTCCC 


120 


TGGCAAGCCT 


ATGACATGGC 


CGGGGCCTGC 


CTCTCTCTGC 


CTCTGACCCT 


CAGTGGCTCC 


160 


CATGAACTCC 


TTGCCCAATG 


GCATCTTTTT 


CCTGCGCTCC 


TTGGGTTATT 


CCAGTCTCCC 


240 


CTCAGCATTC 


CTTCCTCAGG 


GCCTCGCTCT 


TCTCTCTGCT 


CCCTCCTTGC 


ACAGCTGGCT 


300 


CTGTCCACCT 


CAGATGTCAC 


AGTGCTCTCT 


CAGAGGAGGA 


AGGCACCATG 


TACCCTCTGT 


360 


TTCCCAGGTA 


AGGGTTCAAT 


TTTTAAAAAT 


GGTTTTTTGT 


TTGTTTGTTT 


GTTTGTTTGT 


420 


TTGTTTGTTT 


TTCAAGACAG 


GGCTCCTCTG 


TGTAGTCCTA 


ACTGTCTTGA 


AACTCCCTCT 


480 


GTAGACCAGG 


TCGACCTCGA 


ACTCTTGAAA 


CCTGCCACGG 


ACCACCCAGT 


CAGGTATGGA 


540 


GGTCCCTGGA 


ATGAGCGTCC 


TCGAAGCTAG 


GTGGGTAAGG 


GTTCGGCGGT 


GACAAACAGA 


600 


AACAAACACA 


GAGGCAGTTT 


GAATCTGAGT 


GTATTTTGCA 


GCTCTCAAGC 


AGGGGATTTT 


660 
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GAGTTTGGCC 


GTAGGTGGCG 


CCCTCTTCCT 


CC CATC? PPTP. 


A Q A 


GGCTGAAGCA 


GGGCTAGGTC 


GGCGACAACG 


CG CTCGGCT A 


Jt v 


GTGAGGGTAG 


ACTGGAAGTC 


ATCCATGTCC 


AC AAAG CGGT 


ouu 


GTGTAAGTGC 


AGTTGGCCAT 


AACGGACCAG 


TTAACGGTPT 




TCGGTGTACC 


TGAGACGCGA 


GTAAGCCCTC 

* * *• *.^mf W >w mm V* 


GAGTPAAAT21 


/ 20 


ACCAGGTACT 


GGTATCCCAC 


CAAAAAGTGC 


w\jv».\j\jv».\jvjVw A 


/so 


AGGGTGGCCG 


GGGCTCCGGG 


GGCGAGATCT 




840 


TACCTGGACA 


TCCAGGTGAT 


GCCGGCGGCG 

^m* 1 Sm* V#* V»» \m9 ^mt V» XpJ 


GTGGTP.P. 21 PP 


Q f\ 

900 


CGGTTCCAGA 


TGTTGCGCAG 


CGGCAAAAAG 


TGPTPP & TPP 


960 


AGGCGCGCGC 


AATCGTTGAC 


GCTCTACCGT 


G C! A A a n an a p 


1020 


TCCGTGGTCT 


GGTGGATAAA 


TTCGCAAGGG 


TATPZaTpp/t* 
x i-i x \-/\ i vavjv»vj 


i r\ a ri 

1080 


CGTATCCGGC 


CGTCCGCCGT 


GATCCATGCG 


GTTZi PPPPPP 


114 0 


CGACGTCAGA 


CAACGGGGGA 


GTGCTCCTTT 




1200 


GCTAGCTTTT 


TTGGCCACTG 

mm ^mr ~mmr mmr ■ mmm 4m W 


GCCGCGCGPA 


PPPTZi T^ct^r'r* 

vjv-vj 1 >i/i\jv-ljvj 


1260 


All AAvj 1 vao L 


TCGCTCCCTG 


f m m -m &m\ jmm ^mm m*. 

TAGCCGGAGG 


GTTATTTTCC 


1320 


CCCGGTTCGA 


GTCTCGGACC 


GGCCGGACTG 


CGGCGAACGG 


1380 


CAAGACCCCG 


CTTGCAAATT 


CCTCCGGAAA 


CAGGGACGAG 


1440 


GATGCATCCG 


GTGCTGCGGC 


A 




1481 
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ATACATAAAA AAAAAAAAAA AAAAAAAACC AAACATTACA TCTCTTAGAA ACTATATCCA 720 
ATGAAACAAT CACAGATACC AACCAAAACC ATTGGGCAGA GTAAAGCACA AAAATCATCC - 780 

AAGCATTACA ACTCTGAAAC CATGTATTCA GTGAATCACA AACAGAACAG GTAACATCAT 840 
TATTAATATA AATCACCAAA ATATAACAAT TCTAAAAGGA TGTATCCAGT GGGGGCTGTC 900 
GTCCAAGGCT AGTGGCAGAT TTCCAGGAGC AGGTTAGTAA ATCTTAACCA CTGAACTAAC 960 

TCTCCAGCCC CATGGTCAAT TATTATTTAG CATCTAGTGC CTAATTTTTT TTTATAAATC 1020 

TTCACTATGT AATTTAAAAC TATTTTAATT CTTCCTAATT AAGGCTTTCT TTACCATATA 1080 

CCAAAATTCA CCTCCAATGA CACACGCGTA GCCATATGAA ATTTTATTGT TGGGAAAATT 1140 

TGTACCTATC ATAATAGTTT TGTAAATGAT TTAAAAAGCA AAGTGTTAGC CGGGCGTGGT 1200 

GGCACACGCC TTTAATCCCT GCACTCGGGA GGCAGGGGCA GGAGGATTTC TGAGTTTGAG 1260 

GCCAGCCTGG TCTACAGAGT GAGTTCCAGG ACAGCCAGGG CTACACAGAG AAACCCTGTC 132 0 

TCGAACCCCC CACCCCCCAA AAAAAGCAAA GTGTTGGTTT CCTTGGGGAT AAAGTCATGT 1380 

TAGTGGCCCA TCTCTAGGCC CATCTCACCC ATTATTCTCG CTTAAGATCT TGGCCTAGGC 1440 

TACCAGGAAC ATGTAAATAA GAAAAGGAAT AAGAGAAAAC AAAACAGAGA GATTGCCATG 1500 

AGAACTACGG CTCAATATTT TTTCTCTCCG GCGAAGAGTT CCACAACCAT CTCCAGGAGG 1560 

CCTCCACGTT TTGAGGTCAA TGGCCTCAGT CTGTGGAACT TGTCACACAG ATCTTACTGG 1620 

AGGTGGTGTG GCAGAAACCC ATTCCTTTTA GTGTCTTGGG CTAAAAGTAA AAGGCCCAGA 1680 

GGAGGCCTTT GCTCATCTGA CCATGCTGAC AAGGAACACG GGTGCCAGGA CAGAGGCTGG 1740 

ACCCCAGGAA CACCTTAAAC ACTTCTTCCC TTCTCCGCCC CCTAGAGCAG GCTCCCCTCA 1800 

CCAGCCTGGG CAGAAATGGG GGAAGATGGA GTGAAGCCAT ACTGGCTACT CCAGAATCAA 1860 

CAGAGGGAGC CGGGGGCAAT ACTGGAGAAG CTGGTCTCCC CCCAGGGGCA ATCCTGGCAC 1920 

CTCCCAGGCA GAAGAGGAAA CTTCCACAGT GCATCTCACT TCCATGAATC CCCTCCTCGG 1980 

ACTCTGAGGT CCTTGGTCAC AGCTGAGGTG CAAAAGGCTC CTGTCATATT GTGTCCTGCT 2040 

CTGGTCTGCC TTCCACAGCT TGGGGGCCAC CTAGCCCACC TCTCCCTAGG GATGAGAGCA 2100 

GCCACTACGG GTCTAGGCTG CCCATGTAAG GAGGCAAGGC CTGGGGACAC CCGAGATGCC 2160 

TGGTTATAAT TAACCCAGAC ATGTGGCTGC CCCCCCCCCC CCAACACCTG CTGCCTGAGC 2220 

CTCACCCCCA CCCCGGTGCC TGGGTCTTAG GCTCTGTACA CCATGGAGGA GAAGCTCGCT 2280 

CTAAAAATAA CCCTGTCCCT GGTGGATCCA GGGTGAGGGG CAGGCTGAGG GCGGCCACTT 2340 

CCCTCAGCCG CAGGTTTGTT TTCCCAAGAA TGGTTTTTCT GCTTCTGTAG CTTTTCCTGT 2400 

CAATTCTGCC ATGGTGGAGC AGCCTGCACT GGGCTTCTGG GAGAAACCAA ACCGGGTTCT 2460 

AACCTTTCAG CTACAGTTAT TGCCTTTCCT GTAGATGGGC GACTACAGCC CCACCCCCAC 252 0 

CCCCGTCTCC TGTATCCTTC CTGGGCCTGG GGATCCTAGG CTTTCACTGG AAATTTCCCC 2580 

CCAGGTGCTG TAGGCTAGAG TCACGGCTCC CAAGAACAGT GCTTGCCTGG CATGCATGGT 2640 

TCTGAACCTC CAACTGCAAA AAATGACACA TACCTTGACC CTTGGAAGGC TGAGGCAGGG 2700 

GGATTGCCAT GAGTGCAAAG CCAGACTGGG TGGCATAGTT AGACCCTGTC TCAAAAAACC 2760 
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AAAAACAATT AAATAACTAA AGTCAGGCAA GTAATCCTAC TCGGGAGACT GAGGCAGAGG 2820 

GATTGTTACA TGTCTGAGGC CAGCCTGGAC TACATAGGGT TTCAGGCTAG CCCTGTCTAC 2880 

AGAGTAAGGC CCTATTTCAA AAACACAAAC AAAATGGTTC TCCCAGCTGC TAATGCTCAC 2 94 0 

CAGGCAATGA AGCCTGGTGA GCATTAGCAA TGAAGGCAAT GAAGGAGGGT GCTGGCTACA 3000 

ATCAAGGCTG TGGGGGACTG AGGGCAGGCT GTAACAGGCT TGGGGGCCAG GGCTTATACG 3060 

TGCCTGGGAC TCCCAAAGTA TTACTGTTCC ATGTTCCCGG CGAAGGGCCA GCTGTCCCCC 3120 

GCCAGCTAGA CTCAGCACTT AGTTTAGGAA CCAGTGAGCA AGTCAGCCCT TGGGGCAGCC 3180 

CATACAAGGC CATGGGGCTG GGCAAGCTGC ACGCCTGGGT CCGGGGTGGG CACGGTGCCC 3240 

GGGCAACGAG CTGAAAGCTC ATCTGCTCTC AGGGGCCCCT CCCTGGGGAC AGCCCCTCCT 3300 

GGCTAGTCAC ACCCTGTAGG CTCCTCTATA TAACCCAGGG GCACAGGGGC TGCCCCCGGG 3360 
TCAC 

3364 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGTACCACTA CGGGTCTAGG CTGCCCATGT AAGGAGGCAA GGCCTGGGGA CACCCGAGAT 60 

GCCTGGTTAT AATTAACCCC AACACCTGCT GCCCCCCCCC CCCCAACACC TGCTGCCTGA 12 0 

GCCTGAGCGG TTACCCCACC CCGGTGCCTG GGTCTTAGGC TCTGTACACC ATGG 174 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGATCTTCCT AGAAATTCGC TGTCTGCGAG GGCCGGCTGT TGGGGTGAGT ACTCCCTCTC 60 

AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 120 

ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCG TCCATCTGGT CAGAAAAGAC 180 

AATCTTTTTG TTGTCAAGCT TGAGGTGTGG CAGGCTTGAG ATCGATCTGG CCATACACTT 240 

GAGTGACAAT GACATCCACT TTGCCTTTCT CTCCACAGGT GTCCACTCCC AGGTCCAACC 3 00 

GCGGATCTCC CGGGACCATG CCCAAGAAGA AGAGGAAGGT GTCCAATTTA CTGACCGTAC 3 60 

ACCAAAATTT GCCTGCATTA CCGGTCGATG CAACGAGTGA TGAGGTTCGC AAGAACCTGA 420 

TGGACATGTT CAGGGATCGC CAGGCGTTTT CTGAGCATAC CTGGAAAATG CTTCTGTCCG 4 80 
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TTTGCCGGTC GTGGGCGGCA TGGTGCAAGT TGAATAACCG GAAATGGTTT CCCGCAGAAC 540 

CTGAAGATGT TCGCGATTAT CTTCTATATC TTCAGGCGCG CGGTCTGGCA GTAAAAACTA 6 00 

TCCAGCAACA TTTGGGCCAG CTAAACATGC TTCATCGTCG GTCCGGGCTG CCACGACCAA 66 0 

GTGACAGCAA TGCTGTTTCA CTGGTTATGC GGCGGATCC 6 99 
(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAAGATCTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATTACCGAAG AAATGGCTCG 60 
AGATCTTC 68 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAAGATCTCG AGCCATTTCT TCGGTAATAA CTTCGTATAG CATACATTAT ACGAAGTTAT 60 
AGATCTTC 68 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCACATGTAT AACTTCGTAT AGCATACATT ATACGAAGTT ATACATGTGG 50 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCACATGTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATACATGTGG 50 

■ 
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1 . A recombinant plasmid, comprising in operable combination: 

a) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element; 

b) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5 ? and a 3' end and arranged in a tail 
to tail orientation on said plasmid backbone: 

c) the adenovirus packaging sequence, said packaging sequence 
having a 5 ? and a 3' end and linked to one of said ITRs; and 

d) a first gene of interest operably linked to said promoter 
element. 



2. The recombinant plasmid of Claim 1 wherein the total size of said 
plasmid is between 27 and 40 kilobase pairs. 

3. The recombinant plasmid of Claim 1. wherein said 5 ? end of said 
15 packaging sequence is linked to said 3" end of said left ITR. 

4. The recombinant plasmid of Claim 3, wherein said first gene of 
interest is linked to said 3 ? end of said packaging sequence. 

5. The recombinant plasmid of Claim 4. wherein said first gene of 
interest is the dystrophin cDNA gene. 

-° 6 - The recombinant plasmid of Claim 4, further comprising a second 

gene of interest. 

7. The recombinant plasmid of Claim 6. wherein said second gene of 
interest is linked to said 3* end of said right ITR. 



15 



8. The recombinant plasmid of Claim 7. wherein said second gene of 
interest is a reporter gene. 
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9. The recombinant plasmid of Claim 8. wherein said reporter gene is 
selected from the group consisting of the £. coli P-galactosidase gene, the human 
placental alkaline phosphatase gene, the green fluroescent protein gene and the 
chloramphenicol acetyltransferase gene. 

5 10. A mammalian cell line containing the recombinant plasmid of 

Claim 1. 



11. The cell line of Claim 10, wherein said cell line is a 293-derived cell 



line. 



12. A helper adenovirus comprising i) first and a second ioxP sequences. 
10 and ii) the adenovirus packaging sequence, said packaging sequence having a 5 ? 

r 

and a 3* end. 

13. The helper adenovirus of Claim 12, wherein said first IoxP sequence 
is linked to the 5" end of said packaging sequence and said second IoxP sequence is 
linked to said 3' end of said packaging sequence. 

15 14. The helper adenovirus of Claim 12, further comprising at least one 

adenovirus gene coding region. 

15. A mammalian cell line, comprising: 

a) a recombinant plasmid. comprising, in operable combination: 

i) a plasmid backbone, comprising an origin of 

20 replication, an antibiotic resistance gene and a eukaryotic promoter 

element, 

ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone. 

25 iii) the adenovirus packaging sequence, said packaging 

sequence having a 5' and a 3" end and linked to one of said ITRs, 
and 
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iv) a first gene of interest operabiy linked to said 
promoter element: and 

b) a helper adenovirus comprising i) first and a second loxP 
sequences, and ii) the adenovirus packaging sequence, said packaging 
5 sequence having a 5* and a 3' end. 

16. The cell line of Claim 15, wherein said helper adenovirus further 
comprises at least one adenovirus gene coding region. 

17. The cell line of Claim 15, wherein said first gene of interest is the 
dystrophin cDNA gene 

10 1 8. A method of producing an adenovirus minichromosome. comprising: 

A) providing a mammalian cell line containing: 

a) a recombinant plasmid, comprising, in operable 
combination, i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukarotic promoter 
element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5 ! and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone, iii) the adenovirus 
packaging sequence, said packaging sequence having a 5' and a 3' 
end and linked to one of said ITRs, and iv) a first gene of interest 

20 operabiy linked to said promoter element; and 

b) a helper adenovirus comprising i) first and a second 
loxP sequences, ii) at least one adenovirus gene coding region, and 
iii) the adenovirus packaging sequence, said packaging sequence 
having a 5' and a 3' end; and 

25 B) growing said cell line under conditions such that said 

adenovirus gene coding region is expressed and said recombinant plasmid 
directs the production of at least one adenoviral minichromosome. 

19. The method of Claim 18, wherein said adenovirus minichromosome 
is encapidated. 



15 
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20. The method of Claim 18. further comprising 3) recovering said 
encapsidated adenovirus minichromosome. 



21. The method of Claim 20. further comprising 4) purifiying said 
recovered encapsidated adenovirus minichromosome. 

5 22. The method of Claim 21. further comprisng 5) administering said 

purified encapsidated adenovirus minichromosome to a host. 

23. The method of Claim 22. wherein said host is a mammal. 

24. The method of Claim 23, wherein said mammal is a human. 

25. A recombinant adenovirus comprising the adenovirus E2b region 
10 having a deletion, said adenovirus capable of self-propagation in a packaging cell 

line and said E2b region comprising the DNA polymerase gene and the adenovirus 
preterminal protein gene. 

26. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase gene. 

1 5 27. The recombinant adenovirus of Claim 25. wherein said deletion is 

within the adenovirus preterminal protein gene. 

28. The recombinant adenovirus of Claim 25, wherein said deletion is 
within the adenovirus DNA polymerase and preterminal protein genes. 

29. A mammalian cell line stably and constitutively expressing the 
20 adenovirus El gene products and the adenovirus DNA polymerase. 

30. The cell line of Claim 29. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 
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31. The cell line of Claim 30. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral DNA polymerase gene. ' 

32. The cell of Claim 29. wherein the genome of said cell line contains 
a nucleotide sequence encoding adenovirus DNA polymerase operably linked to a 

5 heterologous promoter. 



The cell line of Claim 32, wherein said cell line is selected from the 
group consisting of the B-6, B-9, C-K C-4, C-7, C-13, and C-14 cell lines. 

34. The cell line of Claim 29 further constitutively expressing the 
adenovirus preterminal protein gene product. 

10 35 < The cell line of Claim 34, wherein said cell line comprises a 

recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 

36. The cell line of Claim 35. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral preterminal protein gene. 

15 37 - The cell line of Claim 36, wherein said deletion within said E2b 

region comprises a deletion within the adenoviral DNA polymerase and preterminal 
protein genes. 

38. The cell line of Claim 34, wherein the genome of said cell line 
contains a nucleotide sequence encoding adenovirus preterminal protein operably 

20 linked to a heterologous promoter. 

39. The cell line of Claim 38. wherein said cell line is selected from the 
group consisting of the C-l, C-4. C-7. C-13, and C-14 cell lines. 
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40. A method of producing infectious recombinant adenovirus particles 
containing an adenoviral genome containing a deletion within the E2b region, 
comprising 

a) providing: 

5 i) a mammalian cell line stably and constitutively 

expressing the adenovirus El gene products and the adenovirus DNA 
polymerase; 

ii) a recombinant adenovirus comprising a deletion within 
the E2b region, said recombinant adenovirus being capable of self- 
10 propagation in said cell line; 

b) introducing said recombinant adenovirus into said cell line 
under conditions such that said recombinant adenovirus is propagated to 
form infectious recombinant adenovirus particles: and 

c) recovering said infectious recombinant adenovirus particles. 

15 41 The method of Claim 40, further comprising d) purifying said 

recovered infectious recombinant adenovirus particles. 

42. The method of Claim 41, further comprising e) administering said 
purified recombinant adenovirus particles to a host. 

43. The method of Claim 40, wherein said mammalian cell line further 
20 constitutively expresses the adenovirus preterminal protein. 

44. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

45. The recombinant plasmid of Claim 44. wherein said deletion 
25 comprises a deletion of nucleotides 8772 to 9385 in SEQ ID NO:4. 

46. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pApol. 
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47. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pBHGl lApol. 

48. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 

5 within the preterminal protein gene, said deletion resulting in the inability to 

express functional preterminal protein without disruption of the VA RNA eenes. 

49. The recombinant plasmid of Claim 48, wherein said deletion within 
said preterminal protein gene comprises a deletion of nucleotides 10,705 to 11,134 
in SEQ ID NO:4. 

1 0 50 - The recombinant plasmid of Claim 49. wherein said plasmid has the 

designation pApTP. 

■ 

51. The recombinant plasmid of Claim 49, wherein said plasmid has the 
designation pBHGHApTP. 

52. The recombinant plasmid of Claim 48 further comprising a deletion 
15 within the polymerase gene, said deletion resulting in reduced polymerase activity. 

53. The recombinant plasmid of Claim 52, wherein said deletion within 
said polymerase and said preterminal protein genes comprises a deletion of 
nucleotides 8,773 to 9586 and 1 1,067 to 12,513 in SEQ ID NO:4. 

54. The recombinant plasmid of Claim 53, wherein said plasmid has the 
20 designation pAXBApolApTPVARNA+U3. 

55. The recombinant plasmid of Claim 53, wherein said plasmid has the 
designation pBHG 1 1 Apol ApTPV ARN A+t 1 3 . 
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